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Vaccine 

The present invention relates to methods and compositions useful in the 
treatment and prevention of human papilloma virus infections. In particular the 
invention relates to nucleic acid molecules typically encoding a polyprotein 
based on Early antigens from different HPV strains, and vectors suitable for DNA 
vaccine delivery, and pharmaceutical compositions containing them. Methods 
for manufacturing said molecules, vectors and composition are also 
contemplated, as are their use in medicine. 

Background to the Invention 



The papillomavirus virus is highly tissue and species specific. It infects basal 
epithelial cells and replicates and completes its full life cycle within the cell 
15 nucleus. Viral gene expression is tightly- linked to epithelial cell differentiation 
and capsid assembly and maturation only occurs in fully differentiated epithelial 
cells in the upper epithelial cell layers. 

The infecting human papillomavirus genotypes present in genital warts are 
20 known to be either genotype 6b or genotype 11. The majority (-90%) of genital 
\#arts are infected with HPV6b, whilst approximately 10% are infected with HPV- 
11. The primary infecting genotypes present in infections relating to cervical 
carcinoma are HPV16 and 18. 

25 Human genital warts may develop at the site of infection and they may become 
chronic, persisting for extended periods of time or, alternatively they may regress 
spontaneously resolving completely without scarring. The factors that trigger 
this regression are undefined but it is postulated that cellular response may be 
involved in the disease resolution process. 



Papillomaviruses are not naturally very immunogenic and during the course of 
natural infection antibodies may only occur very late (during or after resolution), 
and in a fraction of patients whilst some patients may resolve disease without 
developing detectable antibody at all. 



35 



WO 2004/1131222 



PCT/.EP2003/01 1.158 



2 



Vaccination using papillomavirus early antigens has been widely studied in 
several different animal model systems. However there are only a few reports 
studying therapeutic immunisation. For example, cattle immunised 
therapeutically with a cocktail of proteins comprising bovine papillomavirus 
5 (BPV) proteins E1 , E2, E4 and E7 showed a reduced papilloma disease burden 
in a proportion of animals compared to controls. 

Papilloma virus infections have been observed in a variety of species, including 
sheep, dogs, rabbits, monkeys, cattle and humans. Human papilloma viruses 

10 (HPV) have been classified into more than 80 types [Epidemiology and Biology 
of Cervical Cancer Seminars in Surgical Oncology 1999 16:203-211. Wolfgang 
MJ, Schoell MD, Janicek MF and Mirhashemi R.], some of which are further 
divided into sub-types (e.g. type 6a and 6b), based on the extent of DNA 
sequence homology. Papilloma viruses generally infect epithelia. but the 

15 different HPV types cause distinct diseases. For example, types 1-4, 7, 10 and 
. 26-29 cause benign warts, types 16, 18, 31, 33, 35, 39, 45, 51, 52, 56, 58, 59, 
and 68 are associated with cervical cancers and types 6 and 1 1 are implicated in 
genital warls (non-malignant condylomata of the genital tract). 

20 HPV has proven difficult to grow in tissue culture, so there is no traditional live or 
attenuated viral vaccine. Development of an HPV vaccine has also been slowed 
by the lack of a suitable animal model in which the human virus can be studied. 
This is because the viruses are highly species specific, so it is very difficult to 
infect an animal with a papilloma virus from a host of a different species, as 

25 would be required for safety testing before a vaccine was first tried in humans. 

Papilloma viruses have a DNA genome which encodes "early" and "late" genes 
designated E1 to E7, L1 and L2. The early gene sequences have been shown 
to have functions relating to viral DNA replication and transcription, evasion of 

30 host immunity, and alteration of the normal host cell cycle and other processes. 
For example the E1 protein is an ATP-dependent DNA helicase and is involved 
in initiation of the viral DNA replication process whilst E2 is a regulatory protein 
controlling both viral gene expression and DNA replication. Through its ability to 
bind to both E1 and the viral origin of replication, E2 brings about a local 

35 concentration of E1 at the origin, thus stimulating the initiation of viral DNA 
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replication. The E4 protein appears to have a number of poorly defined 
functions but amongst these may be binding to the host ceil cytoskeleton, whilst 
E5 appears to delay acidification of endosomes resulting in increased 
expression of EGF receptor at the cell surface and both E6 and E7 are known to 
5 bind cell proteins p53 and pRB respectively. The E6 and E7 proteins form HPV 
types associated with cervical cancer are known oncogenes. L1 and L2 encode 
the two viral structural (capsid) proteins. 

Historically, vaccines have been seen as a way to prevent infection by a 
10 pathogen, priming the immune system to recognise the pathogen and neutralise 
it should an infection occur. The vaccine includes one or more antigens from the 
pathogen, commonly the entire organism, either killed or in a weakened 
(attenuated) form, or selected antigenic peptides from the organism. When the 
immune system is exposed to the antigen(s), cells are generated which retain an 
15 immunological "memory" of it for the lifetime of the individual. Subsequent 
exposure to the same antigen (e.g. upon infection by the pathogen) stimulates a 
specific immune response which results in elimination or inactivation of the 
infectious agent. 

20 There are two arms to the immune response: a humoral (antibody) response and 
a cell-mediated response. Protein antigens derived from pathogens that replicate 
intracellularly (viruses and some bacteria) are processed within the infected host 
cell releasing short peptides which are subsequently displayed on the infected 
cell surface in association with class I major histocompatability (MHC I) 

25 molecules. When this associated complex of MHC I and peptide is contacted by 
antigen-specific CD8+ T-cells the T-cell is activated, acquiring cytotoxic activity. 
These cytotoxic T-cells (CTLs) can lyse infected host cells, so limiting the 
replication and spread of the infecting pathogen. Another important arm of the 
immune response is controlled by CD4+ T-cells. When antigen derived from 

30 pathogens is released into the extracellular milieu they may be taken up by 
specialised antigen-presenting cells (APCs) and displayed upon the surface of 
these cells in association with MHC II molecules. Recognition of antigen in this 
complex stimulates CD4+ T-cells to secrete soluble factors (cytokines) which 
regulate the effector mechanisms of other T-ceils. Antibody is produced by B- 

35 cells. Binding of antigen to secreted antibody may neutralise the infectivity of a 
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pathogen and binding of antigen to membrane-bound antibody on the surface of 
B-cells stimulates division of the B-cell so amplifying the B-cell response. In 
general, good antibody responses are required to control bacterial infections and 
both antibody and cell-mediated immune responses (CD8+ and CD4+) are 
5 required to control infections by viruses. 

It is believed that it may be possible to harness the immune system by 
vaccination, even after infection by a pathogen, to control or resolve the infection 
by inactivation or elimination of the pathogen. Such "therapeutic" vaccines 
10 would require a cell-mediated response to be effective, and would ideally invoke 
both humoral and cell-mediated immune responses. 

It has been demonstrated (Benvenisty, N and Reshaf, L. PNAS 83 9551-9555) 
that inoculation of mice with calcium phosphate precipitated DNA results in 

15 expression of the peptides encoded by the DNA. Subsequently, intramuscular 
injection into mice of plasmid DNA which had not been precipitated was shown 
to result in uptake of the DNA into the muscle cells and expression of the 
encoded protein. Because expression of the DNA results in production of the 
encoded pathogen proteins within the host's cells, as in a natural infection, this 

20 mechanism can stimulate the cell-mediated immune response required for 
therapeutic vaccination. DNA vaccines are described in WO90/11092 (Vical, 
Inc.). 

DNA vaccination may be delivered by mechanisms other than intra-muscular 
25 injection. For example, delivery into the skin takes advantage of the fact that 
immune mechanisms are highly active in tissues that are barriers to infection 
such as skin and mucous membranes. Delivery into skin could be via injection, 
via jet injector (which forces a liquid into the skin under pressure) or via particle 
bombardment, in which the DNA may be coated onto particles of sufficient 
30 density to penetrate the epithelium (US Patent No. 5371015). Projection of 
these particles into the skin results in direct transfection of both epidermal cells 
and epidermal Langerhan cells. Langerhan cells are antigen presenting cells 
(APC) which take up the DNA, express the encoded peptides, and process 
these for display on cell surface MHC proteins. Transfected Langerhan ceils 
35 migrate to the lymph nodes where they present the displayed antigen fragments 
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to lymphocytes, invoking an immune response. Very small amounts of DNA 
(0.5-1 |ig) are required to induce an immune response via particle delivery into 
skin and this contrasts with the milligram quantities of DNA known to be required 
to generate immune responses subsequent to direct intramuscular injection. 

5 

It has been reported, for example in studies using virus like particles formed from 
the L1 and L2 capsid proteins or using these proteins alone (1), that HPV is 
poorly immunogenic. Furthermore, HPV genes have proven difficult to express 
in human or other mammalian cells, leading difficulties in developing protein 

10 subunit vaccines. Monocystronic E1 has proven particularly resistant to 
expression from heterologous promoters in mammalian cells (J.Virology 1999 
73, 3062-3070. Remm M, Remm A and Mart Ustav. Human papilloma virus 
type 18 E1 is translated from polycistronic mRNA by a discontinuous scanning 
mechanism). Expression of E1 is most often detected using in vitro DNA 

15 replication of an HPV origin containing plasmid as a surrogate (Lu, JZJ, Sun et al 
J.Virol 1993 67, 7131-7139 and Del Vecchio AM et al J.Virol 1992 66, 5949- 
5958). 

International patent application WO 02/08435 provides HPV polynucleotide 
20 wherein the sequence has been optimised to resemble the usage patterns of a 
highly expressed human gene. In particular codon optimised HPV6bE1, and 
HPV 1 1 E2 are disclosed. 



Brief Description of the Invention 

25 

The present invention provides novel nucleic acid constructs which are useful in 
the prophylaxis and more particularly in the treatment of the human 
papillomaviral indured genital warts, or other HPV induced sequalae. 

.30 According to a first aspect of the present invention there is provided a nucleic 
acid construct encoding a polyprotein containing epitopes from at least two 
distinct Early antigens. Preferably the present invention provides a nucleic acid 
construct encoding a polyprotein comprising epitopes from three distinct Early 
antigens. Such construct have been shown by the present inventors to be more 

35 efficacious in animal models than the single protein approach. 



WO 2UU4/031222 



PCT/EP2003/H11158 



6 

Detailed Description 

Preferred constructs include nucleic acids coding for E2 from two different HPV 
genotypes such as HPV6b and E2 from HPV - 11. Additionally it is preferred if 
5 an E1 encoding sequence is present. Preferably the E1 is from HPV 6 or 1 1 . 

Preferred construct include a nucleic acid molecule having the following 
arrangement: 

10 1) HPV6bE1 -HPV6bE2-HPV11E2 

2) HPV6bE2 - HPV6bE1 - HPV1 1 E2 

3) HPV6bE2 - HPV11E2 - HPV6bE1 

Most preferably all the nucleic acid sequence of the above polyprotein has been 
15 codon optimised to resemble the codon usage of a highly expressed human 
gene. Preferably the E1 and E2 genes are substantially full length or more 
preferably full length. By substantially full length means at least 85% preferably 
90% of the E1 and E2 polypeptide is encoded. Surprisingly, such constructs, 
express to the equivalent expression levels as codon optimised individual 
20 proteins, and have the advantage that a single plasmid encoding the 
polyproteins is cheaper and easier to manufacture than three individual 
plasmids. 

It is preferred that these genes are codon optimised such that the codon usage 
25 pattern resembles that of actin, a highly expressed human gene product. 

The polynucleotide sequence may be a DNA sequence, for example a double 
stranded DNA sequence. Preferably the polynucleotide sequence encodes a 
HPV polypeptide of HPV type 6, 11, 16, 18, 33 or 45, most preferably type 11, 
30 sub-type 6a or sub-type 6b. In certain embodiments the encoded amino acid 
sequence is a wild-type HPV amino acid sequence. In alternative embodiments, 
the encoded amino acid sequence is a mutated HPV amino acid sequence 
comprising the wild-type sequence with amino acid changes, for example amino 
acid point mutations, sufficient to reduce or inactivate one or more of the natural 
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biological functions of the polypeptide. The mutated amino acid sequence will 
desirably retain the immunogenicity of the wild-type polypeptide. 
Proteins encoded by the polynucleotides of the invention also form an aspect of 
the present invention. 

5 

In the case of E1, the primary biological role is to initiate virus specific DNA 
replication in infected cells. It is preferred that E1 is mutated to inactivate its 
replication potential. 

10 The preferred mutations are: G 482 D 

K83G 
R84G 

Preferably two or more mutations are included. 

15 

Most preferably 3 mutations are included. 

In the case of E2, this is a site specific binding nuclear protein functioning as the 
primary replication origin recognition protein and assists in the assembly of the 
20 pre-initiation replication complex. It is preferred that the E2 protein is 
inactivated. A preferred mutation to achieve this objective is K1 1 1 A. 

According to one aspect of the present invention, the codon usage pattern of the 
polynucleotide will preferably exclude codons with an RSCU value of less than 

25 0.2 in highly expressed genes in humans. A relative synonymous codon usage 
(RSCU) value is the observed number of codons divided by the number 
expected if all codons for that amino acid were used equally frequently. A 
polynucleotide of the present invention will generally have a codon usage 
coefficient for highly expressed human genes of greater than 0.3, preferably 

30 greater than 0.4, most preferably greater than 0.5. According to a second 
aspect of the invention, an expression vector is provided which comprises and is 
capable of directing the expression of a polynucleotide sequence according to 
the invention, said polynucleotide encoding a polypeptide having epitopes from 
two or more Early antigens. The vector may be suitable. for driving expression of 
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heterologous DNA in bacterial insect or mammalian cells, particularly human 
cells. In one embodiment, the expression vector is p7313PLc. 

In a further aspect, the present invention provides a vaccine composition 
5 comprising a protein, or vector, or polynucleotide sequence of the invention. 
Preferably the vaccine composition comprises a DNA vector according to the 
present invention. In preferred embodiments the vaccine composition comprises 
a plurality of particles, preferably gold particles, coated with DNA comprising a 
vector containing a polynucleotide sequence which encodes a polypeptide 
10 having epitopes from two or more Early antigens. In alternative embodiments, 
the vaccine composition comprises a pharmaceutical^ acceptable excipient and 
a DNA vector according to the second aspect of the present invention. The 
vaccine composition may also include an adjuvant. 

15 In a further aspect, the present invention provides a method of making a vaccine 
composition including constructing a polynucleotide that encodes a polypeptide 
that has epitopes from two or more Early antigens and formulating with a 
pharmaceutical^ acceptable excipient. 

20 Also provided are the use of a polynucleotide or a vector according to the 
invention, in the treatment or prophylaxis of an HPV infection, preferably an 
infection of HPV type 6, 11, 16 or 18. The invention also provides the use of a 
polynucleotide, a vector according to the invention, in the treatment or 
prophylaxis of cutaneous (skin) warts, genital warts, atypical squamous cells of 

25 undetermined significance (ASCUS), cen/ical dysplasia, cervical intraepithelial 
neoplasia (CIN) or cervical cancer. Accordingly, the present invention also 
provides the use of a polynucleotide or of a vector according to the invention in 
making a vaccine for the treatment or prophylaxis of an HPV infection or any 
symptoms or disease associated therewith. 

30 

The present invention also provides methods of treating or preventing HPV 
infections or any symptoms or diseases associated therewith comprising 
administering an effective amount of a protein, polynucleotide or a vector or a 
vaccine according to the invention. Administration of a vaccine may take the 
35 form of one or more individual doses, for example in a "prime-boost" regime. In 
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certain cases the "prime" vaccination may be via DNA vaccine delivery, in 
particular via particle mediated DNA delivery of a polynucleotide according to the 
present invention, preferably incorporated into a plasmid-derived vector and the 
"boost" by administration of a recombinant viral vector comprising the same 
5 polynucleotide sequence. Alternatively, a protein adjuvant approach may act as 
part of the priming or boosting approach, with DNA delivered as the other arm of 
the prime-boost regime (the protein being the same as the protein encoded by 
the DNA). 

10 Throughout the present specification and the accompanying claims the words 
"comprise" and "include" and variations such as "comprises", "comprising", 
"includes" and "including" are to be interpreted inclusively. That is, these words 
are intended to convey the possible inclusion of other elements or integers not 
specifically recited, where the context allows. 

15 

The term "variant" refers to a polynucleotide which encodes the same amino 
acid sequence as another polynucleotide of the present invention but which, 
through the redundancy of the genetic code, has a different nucleotide sequence 
whilst maintaining the same codon usage pattern, for example having the same 
20 codon usage coefficient or a codon usage coefficient within 0.1 , preferably within 
0.05 of that of the other polynucleotide. 

The term "codon usage pattern" refers to the average frequencies for all codons 
in the nucleotide sequence, gene or class of genes under discussion (e.g. highly 
25 expressed mammalian genes). Codon usage patterns for mammals, including 
humans can be found in the literature (see e.g. Nakamura et.al. Nucleic Acids 
Research 1996, 24:214-215). 

In the polynucleotides of the present invention, the codon usage pattern is 
30 altered from that typical of human papilloma viruses to more closely represent 
the codon bias of a human. The "codon usage coefficient" is a measure of how 
closely the codon pattern of a given polynucleotide sequence resembles that of 
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a target species. Codon frequencies can be derived from literature sources for 
the highly expressed genes of many species (see e.g. Nakamura et.al. Nucleic 
Acids Research 1996, 24:214-215). The codon frequencies for each of the 61 
codons (expressed as the number of occurrences occurrence per 1000 codons 
5 of the selected class of genes) are normalised for each of the twenty natural 
amino acids, so that the value for the most frequently used codon for each 
amino acid is set to 1 and the frequencies for the less common codons are 
scaled to lie between zero and 1. Thus each of the 61 codons is assigned a 
value of 1 or lower for the highly expressed genes of the target species. In order 

10 to calculate a cocion usage coefficient for a specific polynucleotide, relative to 
the highly expressed genes of that species, the scaled value for each codon of 
the specific polynucleotide are noted and the geometric mean of all these values 
is taken (by dividing the sum of the natural logs of these values by the total 
number of codons and take the anti-log). The coefficient will have a value 

15 between zero and 1 and the higher the coefficient the more codons in the 
polynucleotide are frequently used codons. If a polynucleotide sequence has a 
codon usage coefficient of 1, all of the codons are "most frequent" codons for 
highly expressed genes of the target species. 

20 Shorter polynucleotide sequences are within the scope of the invention. For 
example, a polynucleotide of the invention may encode a fragment of a HPV 
protein. A polynucleotide which encodes a fragment of at least 8, for example 1- 
10 amino acids or up to 20, 50, 60, 70, 80, 100, 150 or 200 amino acids in length 
is considered to fall within the scope of the invention as long as the 

25 polynucleotide encodes a polypeptide that demonstrates HPV antigenicity. In 
particular, but not exclusively, this aspect of the invention encompasses the 
situation when the polynucleotide encodes a fragment of a complete HPV 
protein sequence and may represent one or more discrete epitopes of that 
protein. 



30 
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As discussed above, the present invention includes expression vectors that 
comprise the nucleotide sequences of the invention. Such expression vectors 
are routinely constructed in the art of molecular biology and may for example 
involve the use of plasmid DNA and appropriate initiators, promoters, enhancers 
and other elements, such as for example polyadenylation signals which may be 
necessary, and which are positioned in the correct orientation, in order to allow 
for protein expression. Other suitable vectors would be apparent to persons 
skilled in the art. By way of further example in this regard we refer to Sambrook 
et al. Molecular Cloning: a Laboratory Manual. 2 nd Edition. CSH Laboratory 
Press. (1989). 

Preferably, a polynucleotide of the invention or for use in the invention in a 
vector is operably linked to a control sequence which is capable of providing for 
the expression of the coding sequence by the host cell, i.e. the vector is an 
expression vector. The term "operably linked" refers to a juxtaposition wherein 
the components described are in a relationship permitting them to function in 
their intended manner. A regulatory sequence, such as a promoter, "operably 
linked" to a coding sequence is positioned in such a way that expression of the 
coding sequence is achieved under conditions compatible with the regulatory 
sequence. 

The vectors may be for example, plasmid, artificial chromosome, virus or phage 
vectors provided with a origin of replication, optionally a promoter for the 
expression of the said polynucleotide and optionally a regulator of the promoter. 
The vectors may contain one or more selectable marker genes, for example an 
ampicillin or kanomycin resistance gene in the case of a bacterial plasmid or a 
resistance gene for a fungal vector. Vectors may be used in vitro, for example 
for the production of DNA or RNA or used to transfect or transform a host cell, 
for example, a mammalian host cell. The vectors may also be adapted to be 
used in vivo, for example in a method of DNA vaccination or of gene therapy. 
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Promoters and other expression regulation signals may be selected to be 
compatible with the host cell for which expression is designed. For example, 
mammalian promoters include the metallothionein promoter, which can be 
induced in response to heavy metals such as cadmium, and the p-actin 
promoter. Viral promoters such as the SV40 large T antigen promoter, human 
cytomegalovirus (CMV) immediate early (IE) promoter, rous sarcoma virus LTR 
promoter, adenovirus promoter), or a HPV promoter, particularly the HPV 
upstream regulatory region (URR) may also be used. All these promoters are 
readily available in the art. 

Examples of suitable viral vectors include herpes simplex viral vectors, vaccinia 
or alpha-virus vectors and retroviruses, including lentiviruses, adenoviruses and 
adeno-associated viruses. Gene transfer techniques using these viruses are 
known to those skilled in the art. Retrovirus vectors for example may be used to 
stably integrate the polynucleotide of the invention into the host genome, 
although such recombination is not preferred. Replication-defective adenovirus 
vectors by contrast remain episomal and therefore allow transient expression. 
Vectors capable of driving expression in insect cells (for example baculovirus 
vectors), in human cells or in bacteria may be employed in order to produce 
quantities of the HPV protein encoded by the polynucleotides of the present 
invention, for example for use as subunit vaccines. Preferred viral vectors are 
those derived from non-human primate adenovirus such as C68 chimp 
adenovirus (US 6, 083, 716) otherwise known as Pan 9. 

Where the polynucleotides of the present invention find use as therapeutic 
agents, e.g. in DNA vaccination, the nucleic acid will be administered to the 
mammal e.g. human to be vaccinated. The nucleic acid, such as RNA or DNA, 
preferably DNA, is provided in the form of a vector, such as those described 
above, which may be expressed in the cells of the mammal. The polynucleotides 
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may be administered by any available technique. For example, the nucleic acid 
may be introduced by needle injection, preferably intradermal^, subcutaneously 
or intramuscularly. Alternatively, the nucleic acid may be delivered directly 
across the skin using a nucleic acid delivery device such as particle-mediated 
5 DNA delivery (PMDD). In this method, inert particles (such as gold beads) are 
coated with a nucleic acid, and are accelerated at speeds sufficient to enable 
them to penetrate a surface of a recipient (e.g. skin), for example by means of 
discharge under high pressure from a projecting device. (Particles coated with a 
nucleic acid molecule of the present invention are within the scope of the present 
10 invention, as are devices loaded with such particles). 

Suitable techniques for introducing the naked polynucleotide or vector into a 
patient include topical application with an appropriate vehicle. The nucleic acid 
may be administered topically to the skin, or to mucosal surfaces for example by 

15 intranasal, oral, intravaginal or intrarectal administration. The naked 
polynucleotide or vector may be present together with a pharmaceutical^ 
acceptable excipient, such as phosphate buffered saline (PBS). DNA uptake 
may be further facilitated by addition of facilitating agents such as bupivacaine to 
the composition. Other methods of administering the nucleic acid directly to a 

20 recipient include ultrasound, electrical stimulation, electroporation and 
microseeding which is described in US-5,697,901. 

Uptake of nucleic acid constructs may be enhanced by several known 
transfection techniques, for example those including the use of transfection 

25 agents. Examples of these agents includes cationic agents, for example, 
calcium phosphate and DEAE-Dextran and lipofectants, for example, lipofectam 
and transfectam. The dosage of the nucleic acid to be administered can be 
altered. Typically the nucleic acid is administered in an amount in the range of 
1pg to 1mg, preferably to 1pg to 10jag nucleic acid for particle mediated gene 

30 delivery and 1 0jig to 1 mg for other routes. 
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A nucleic acid sequence of the present invention may also be administered by 
means of specialised delivery vectors useful in gene therapy. Gene therapy 
approaches are discussed for example by Verme et a/, Nature 1997, 389:239- 
242. Both viral and non-viral systems can be used. Viral based systems include 
retroviral, lentiviral, adenoviral, adeno-associated viral, herpes viral, Canarypox 
and vaccinia-viral based systems. Non-viral based systems include direct 
administration of nucleic acids and liposome-based systems. 

A nucleic acid sequence of the present invention may also be administered- by 
means of transformed cells. Such cells include cells harvested from a subject. 
The naked polynucleotide or vector of the present invention can be introduced 
into such cells in vitro and the transformed cells can later be returned to the 
subject. The polynucleotide of the invention may integrate into nucleic acid 
already present in a cell by homologous recombination events. A transformed 
cell may, if desired, be grown up in vitro and one or more of the resultant cells 
may be used in the present invention. Cells can be : provided at an appropriate 
site in a patient by known surgical or microsurgical techniques (e.g. grafting, 
micro-injection, etc.) 

The vaccine compositions of the present invention may include adjuvant 
compounds which may serve to increase the immune response induced by the 
protein itself or which is encoded by the plasmid DNA. Alteration of the codon 
bias to suit the vaccinated species is proposed herein as a means of increasing 
expression and thereby boosting the immune response, but an adjuvant may 
never-the-less be desirable because, while DNA vaccines tend to work well in 
mice models, there is evidence of a somewhat weaker potency in larger species 
such as non-human primates which is thought to be predictive, of the likely 
potency in humans. 

The vaccine composition of the invention may also comprise an adjuvant, such 
as, for example, in an embodiment, imiquimod, tucaresol or alum. 
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Preferably (he adjuvant is administered at the same time as of the invention and 
in preferred embodiments are formulated together. Such adjuvant agents 
contemplated by the invention include, but this list is by no means exhaustive 
and does not preclude other agents: synthetic imidazoquinolines such as 
5 imiquimod [S-26308, R-837], (Harrison, et al. 'Reduction of recurrent HSV 
disease using imiquimod alone or combined with a glycoprotein vaccine', 
Vaccine 19: 1820-1826, (2001)); and resiquimod [S-28463, R-848] (Vasilakos, et 
al. 1 Adjuvant activates of immune response modifier R-848: Comparison with 
CpG ODN\ Cellular immunology 204: 64-74 (2000).), Schiff bases of carbonyls 

10 and amines that are constitutively expressed on antigen presenting cell and T- 
cell surfaces, such as tucaresol (Rhodes, J. et al. ' Therapeutic potentiation of 
the immune system by costimulatory Schiff-base-forming drugs', Nature 377: 71- 
75 (1995)), cytokine, chemokine and co-stimulatory molecules, Th1 inducers 
such as interferon gamma, IL-2, IL-12, IL-15 and IL-18, Th2 inducers such as IL- 

15 4, IL-5, IL-6, IL-10 and IL-13 and other chemokine and co-stimulatory genes 
such as MCP-1 , MIP-1 alpha, MIP-1 beta, RANTES, TCA-3, CD80, CD86 and 
CD40L, other immunostimulatory targeting ligands such as CTLA-4 and L- 
selectin, apoptosis stimulating proteins and peptides such as Fas, (49), synthetic 
lipid based adjuvants, such as vaxfectin, (Reyes et al., Vaxfectin enhances 

20 antigen specific antibody titres and maintains Th1 type immune responses to 
plasmid DNA immunization', Vaccine 19: 3778-3786) squalene, alpha- 
tocopherol, polysorbate 80, DOPC and cholesterol, endotoxin, [LPS], Beutler, 
B., 'Endotoxin, Toll-like receptor 4, and the afferent limb of innate immunity', 
Current Opinion in Microbiology 3: 23-30 (2000)) ; CpG oligo- and di-nucleotides, 

25 Sato, Y. et al., Immunostimulatory DNA sequences necessary for effective 

intradermal gene immunization', Science 273 (5273): 352-354 (1996). Hemmi, 
H. et al., 'A Toll-like receptor recognizes bacterial DNA', Nature 408: 740-745, 
(2000) and other potential ligands that trigger Toll receptors to produce Th1- 
inducing cytokines, such as synthetic Mycobacterial lipoproteins, Mycobacterial 

30 protein p19, peptidoglycan, teichoic acid and lipid A. 
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Certain preferred adjuvants for eliciting a predominantly Th1 -type response 
include, for example, a Lipid A derivative such as monophosphoryl lipid A, or 
preferably 3-de-O-acylated monophosphoryl lipid A. MPL® adjuvants are 
5 available from Corixa Corporation (Seattle, WA; see, for example, US Patent 
Nos. 4,436,727; 4,877,61 1; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Th1 response. Such oligonucleotides are well known and are 
described, for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 
10 6,008,200 and 5,856,462. Immunostimulatory DNA sequences are also 
described, for example, by Sato et al., Science 273:352, 1996. Another 
preferred adjuvant comprises a saponin, such as Quil A, or derivatives thereof, 
including QS21 and QS7 (Aquila Biopharmaceuticals Inc., Framingham, MA); 
Escin; Digitonin; or Gypsophila or Chenopodium quinoa saponins. 

15 

In an embodiment, the adjuvant comprises an immunostimulatory CpG 
oligonucleotide, such as disclosed in (WO96102555). Typical 
immunostimulatory oligonucleotides will be between 8-100 bases in length and 
comprises the general formula Xi CpGX 2 where Xi and X 2 are nucleotide bases, 
20 and the C and G are unmethylated. 

The preferred oligonucleotides for use in adjuvants or vaccines of the present 
invention preferably contain two or more dinucleotide CpG motifs preferably 
separated by at least three, more preferably at least six or more nucleotides. 

25 The oligonucleotides of the present invention are typically deoxynucleotides. In 
a preferred embodiment the internucleotide in the oligonucleotide is 
phosphorodilhioate, or more preferably a phosphorothioate bond, although 
phosphodiester and other internucleotide bonds are within the scope of the 
invention including oligonucleotides with mixed internucleotide linkages, e.g. 

30 mixed phosphorothioate/phophodiesters. Other internucleotide bonds which 
stabilise the oligonucleotide may be used. Methods for producing 
phosphorothioate oligonucleotides or phosphorodithioate are described in 
US5,666,153, US5,278,302 and WO95/26204. 
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Examples of preferred oligonucleotides have the following sequences. The 
sequences preferably contain phosphorothioate modified internucleotide 
linkages. 

5 

OLIGO 1 : TCC ATG ACG TTC CTG ACG TT (CpG 1826) (SEQ ID NO 24) 
OLIGO 2: TCT CCC AGC GTG CGC CAT (CpG 1758) (SEQ ID NO 25) 
OLIGO 3: ACC GAT GAC GTC GCC GGT GAC GGC ACC ACG (SEQ ID NO 26) 
OLIGO 4: TCG TCG TTT TGT CGT TTT GTC GTT (CpG 2006) (SEQ ID NO 27) 
1 0 OLIGO 5: TCC ATG ACG TTC CTG ATG CT (CpG 1 668) (SEQ ID NO 28) 

Alternative CpG oligonucleotides may comprise the preferred sequences above 
in that they have inconsequential deletions or additions thereto. 

15 The CpG oligonucleotides utilised in the present invention may be synthesized 
by any method Known in the art (eg EP 468520). Conveniently, such 
oligonucleotides may be synthesized utilising an automated synthesizer. An 
adjuvant formulation containing CpG oligonucleotide can be purchased from 
Qiagen under the trade name "ImmunEasy". 
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The following Examples serve to further illustrate the invention, with reference to 
the accompanying drawings, in which: 

Fig 1 is a schematic view of HPV Immunotherapeutic vaccine construct of the 
5 invention. 

. Fig 2 is a plasmid map of P70776be2 - encoding HPV 6b E2 that has been 
codon optimised and mutated. . 

10 Fig 3 is a plasmid map of p73p1c6be1 - encoding HPV 6b E1 that has been 
codon optimised and mutated. 

Fig 4 is a plasmid map of p70771 1e2 - encoding HPV 1 1 E2 that has been 
codon optimised and mutated. 

15 

Fig 5 is a plasmid map of HPV 102 -encoding HPV 11 E2 in p7313 background. 

Fig 6 is a plasmid map of HPV 104 - fusion of E2 from HPV 6b and E2 from 

HPV 11 in p7313 background. 

20 

Fig 7is a plasmid map of HPV 105 - fusion of codon optimised, mutated HPV 6b 
E2 and E2fromHPV11. 

Fig 8 is a plasmid map of HPV 108 - HPV 6b E1 codon optimised, mutated in 
25 p7313 background. 

Fig 9 is a plasmid map of HPV 1 10 - HPV 6b E2 codon optimised, mutated in 
p7313 background. 

30 Fig 1 0 is a plasmid map of HPV 1 1 6 - HPV 6b E1 , HPV 6b E2, HPV 1 1 E2. 
Fig 1 1 is a plasmid map of HPV 1 1 7 - HPV 6b E2, HPV 1 1 E2, HPV 6b E1 . 
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Fig 12 is a plasmid map of HPV 118 - HPV 6b E2, HPV 11 E2, HPV 6b E1. 

Fig 13 is u western blot analysis of three polyprotein constructs of the invention 
in 293 T cells. 

Fig 14 shows the incapacity of KIIIA mutated E2 in an invitro CAT transcriptional 
reporter assay. 

Fig 15 shuws cellular immune response in mice to E1 
Fig 16 shows ceilular immune response to E2 
Fig 17 - CTL assay data with HPV 118 after PMID 

Fig 18 shows reduction of warts after administration of E1/E2 in the COPV 
model. 



1. Plasmid: pWRG7077 6be2 c/o mutated 

Gene of interest: 

The HPV6be2 gene is approximately 1.1Kb in size and a codon optimised 
sequence (for human expression) was created using a visual basic programme 
called Syngene. In addition the sequence included a codon change at amino 
acid position 1 1 1 , whereby a lysine residue (AAG) in the wild type was changed 
to an alanine residue (GCA) creating a mutated gene. This change inactivates 
the transcriptional activity of 6be2. Overlapping primers incorporating the whole 
gene with selected restriction sites at both the 5* and 3' ends were designed 
accordingly. 

Cloning: 

The 1.1 kb PCR fragment was gel purified and digested with restriction enzymes 
Not J and Bam HI for ligation into vector pWRG7077 (Powderject). The gene is 
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under control of the full immediate early CMV promoter and have a bovine 
growth hormone poly A tail. 

Clones were sequenced indicated a number of base errors. A number of suitable 
clones were identified to enable construction of the correct gene sequence by 
using restriction digests. From re-cloning, one clone C7 was found to have only 
one base error at position 497 (T to C). Other clones were o.k. in this area and a 
simple fragment swap was just needed to correct the error. The final clone C7a 
was confirmed to be codon optimised mutated 6be2. (See Fig. 2) 

Sbe2 sequence in pWRG7077 (Sequence ID No. 1) 

ATGGAAGCTATTGCCAAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGTACGA 

GGAAAAC'AGCACAGACCTCCACAAGCACGTGCTGCACTGGAAGTGCATGCGCCACGAGT 

CAGTGCTCCTGTACAAGGCCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTG 

CCCCCGCTGAAGGTGAGCGAAGCCAAGGGCCACAACGCTATCGAGATGCAGATGCACCT 

GGAGAGCCTGCTGCGGACCGAATACAGCATGGAGCCCTGGACTCTCCAGGAGACGTCCT 

ACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCGCAAAGCGCGGCAAGACAGTTGAG 

GTGAAATTCGA .• GGGTGCGCAAACAACACGATGGACTACGTGGTGTGGACCGATGTCTA 

CGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCAAGGGCATCT 

ATTACACCTGCGGGCAGTTCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAG 

TATGGTTCCACCAAGCACTGGGAGGTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGC 

CTCCGTGTCGTCCACCACCCAGGAAGTGAGCATTCCGGAGAGCACCACATACACCCCGG 

CCCAAACGAGCACGCTCGTCAGCAGCAGCACCAAGGAGGACGCCGTCCAGACGCCCCCC 

CGGAAGAGGGCCCGGGGGGTCCAGCAGTCTCCCTGCAATGCCCTGTGCGTTGCTCACAT 

CGGCCCTGTCGATTCTGGGAACCACAATCTCATCACGAACAACCACGACCAGCACCAAA 

GGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGTTCCAGGGGGAGTCCAAC 

TGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTTCGACTTGAT 

CAGTTCCACGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGG 

TGACCTACGACTCCGAGGAGCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCG 

ACAATCAGCCACAAGCTTGGCTTCATGTCCCTGCACCTGCTGTGA 

Amino acid sequence (Seq. ID No. 2) 

MEAIAKRLDA CQEQLLELYE ENSTDLHKHV LHWKCMRHES VLLYKAKQMG LSHIGMQWP 
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PLKVSEAKGH KAIEMQMHLE SLLRTEYSME PWTLQETSYE MWQTPPKRCF AKRGKTVEVK 
FDGCANNTMD YVVWTDVYVQ DNDTWKVHS MVDAKGIYYT CGQFKTYYVN FVKEAEKYGS 
TKHWEVCYGS TVICSPASVS STTQEVSIPE STTYTPAQTS TLVSSSTKED AVQTPPRKRA 
RGVQQSPCWA LCVAHIGPVD SGNHNLITNN HDQHQRRNNS NSSATPIVQF QGESNCLKCF 
RYRLNDRHRH LFDLISSTWH WASSKAPHKH AIVTVTYDSE EQRQQFLDW KIPPTISHKL 
GFMSLHLL 

2. Plasmid: p7313plc 6be1 c/o mut 
Gene of interest: 

The HPV6be1 gene is approximately 2Kb in size and a codon optimised wild 
type (wt) sequence (for E.coli and human expression) was created using a 
statistical visual basic programme called Syngene. Overlapping primers 
incorporating the whole gene with selected restriction sites at both the 5' and 3' 
ends were designed accordingly. The synthesised gene was then digested with 
Bam HI and Not I restriction enzymes for ligation into vector pCIN4. From the 
sequencing data for a number of selected clones, numerous base errors were 
discovered. A correct clone was generated by combiriing a correct Pst l-Bam HI 
fragment from clone #24 and a Not l-Pst I fragment from clone #21 into p7313- 
plc. A correct done (#1) was confirmed by sequencing. For mutagenesis 
primers were designed to change the following amino acids; lysine (AAA) to 
glycine (GGA) at position 83, arginine (CGC) to glycine (GGC) at position 84 and 
glycine (GGC) to asparagine(GAC) at position 482. 

Gbel codon optimised mutated sequence 
(Seq ID No. 3) 

ATGGCAGACGATTCCGGTACTGAGAACGAAGGTTCTGGTTGTACCGGTTGGTTCATGGT 
TGAAGCAATCGTTCAGCATCCGACTGGTACCCAGATCTCCGATGACGAAGACGAAGAAG 
TTGAAGATTCTGGTTACGACATGGTTGACTTCATCGATGACTCCAACATCACTCATAAC 
TCTCTGGAAGCACAGGCTCTGTTTAACCGCCAGGAAGCTGATACCCATTACGCTACTGT 
TCAGGACCTGGGAGGCAAATATCTGGGCTCTCCGTACGTTTCCCCGATCAACACTATCG 
CAGAAGCAGTTGAGTCTGAAATCTCCCCGCGCCTGGACGCTATCAAACTGACTCGTCAG 
CCGAAGAAGGTTAAACGTCGTCTGTTCCAGACTCGTGAACTGACCGACTCCGGTTACGG 
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TTATAGCGAAGTTGAGGCTGGCACCGGCACCCAGGTTGAAAAACACGGTGTACCGGAAA 

ACGGCGGCGACGGTCAGGAAAAGGACACCGGCCGCGACATCGAGGGTGAGGAACACACC 

GAAGCTGAAGCTCCGACTAACTCTGTTCGTGAACACGCAGGTACTGCGGGTATCCTGGA 

ACTGCTGAAATGCAAAGACCTGCGCGCGGCTCTGCTGGGCAAATTCAAAGAATGCTTCG 

GCCTGTCTTTCATTGACCTGATCCGTCCGTTTAAGTCTGACAAAACTACCTGTCTGGAC 

TGGGTTGTAGCAGGCTTCGGCATCCACCACTCTATCTCTGAAGCATTCCAGAAACTGAT 

CGAGCCGCTGTCTCTGTACGCGCACATCCAGTGGCTGACTAACGCTTGGGGTATGGTTC 

TGCTGGTACTGCTGCGCTTTAAAGTAAACAAATCTCGTTCCACTGTTGCTCGTACTCTG 

GCTACCCTGCTGAACATCCCGGAGAACCAGATGCTGATCGAACCGCCGAAAATCCAGTC 

TGGTGTAGCTGCACTGTACTGGTTTCGTACTGGCATCTCTAACGCTAGCACTGTTATCG 

GTGAAGCACCGGAATGGATCACTCGTCAGACCGTTATCGAACACGGTCTGGCAGATTCT 

CAGTTCAAACTGACTGAAATGGTTCAGTGGGCATACGACAACGACATCTGCGAGGAATC 

TGAAATTGCGTTCGAATACGCTCAGCGTGGCGACTTCGACTCCAACGCTCGTGCTTTCC 

TGAACAGCAA.CATGCAGGCTAAATACGTAAAAGACTGCGCTACCATGTGCCGTCACTAC 

AAACACGCGGAAATGCGTAAAATGTCTATCAAACAGTGGATCAAGCACCGCGGTTCTAA 

AATCGAAGGTACCGGTAACTGGAAACCGATCGTTCAGTTCCTGCGCCATCAGAACATCG 

AATTCAT'CCCGTTCCTGACCAAATTCAAGCTGTGGCTGCACGGTACCCCGAAAAAAAAC 

TGCATCGCTATCGTAGGTCCACCGGACACTGACAAGTCTTACTTCTGTATGTCCCTGAT 

CTCTTTCCTGGGCGGCACTGTAATCTCTCACGTTAACTCTTCCTCCCATTTCTGGCTGC 

AGCCACTGGTAGACGCGAAAGTAGCTCTGCTGGACGACGCGACCCAGCCGTGCTGGATC 

TACATGGATACTTACATGCGCAACCTGCTGGACGGTAACCCGATGTCTATCGACCGTAA 

ACACAAAGCGCTGACTCTGATCAAGTGCCCGCCGCTGCTGGTAACTTCTAACATCGACA 

TCACCAAGGAAGATAAATACAAGTACCTGCATACCCGTGTTACTACCTTTACTTTCCCG 

AACCCGTTCCCGTTTGATCGTAACGGTAACGCTGTTTACGAACTGTCCAACACTAACTG 

GAAATGCTTCTTCGAGCGTCTGTCTTCCTCCCTGGACATCCAGGACTCTGAAGATGAAG 

AAGATGGTTCTAACTCTCAGGCTTTCCGTTGTGTTCCGGGTACTGTTGTTCGTACTCTG 

TGA 

Amino acid sequence (Seq. ID No. 4) 

MADDSC-TENE GSGCTGWFMV EAIVQHPTGT QISDDEDEEV EDSGYDMVDF 
IDDSNITHNS LEAQALFNRQ EADTHYATVQ DLGGKYLGSP YVSPINTIAE 
AVESEISPRL, DAI KLTRQPK KVKRRLFQTR ELTDSGYGYS EVEAGTGTQV 
EKHGVPENGG DGQEKDTGRD IEGEEHTEAE APTNSVREHA GTAGILELLK 
CKDLRAALLG KFKECFGLSF IDLIRPFKSD KTTCLDWVA GFGIHHSISE 
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AFQKLIBPLS LYAHIQWLTN AWGMVLLVLL RFKVNKSRST VARTLATLLN 
IPENQMLIEP PKIQSGVAAL YWFRTGISNA STVIGEAPEW ITRQTVIEHG 
LADSQFKLTE MVQWAYDNDI CEESEI'AFEY AQRGDFDSNA RAFLNSNMQA 
KYVKDCATMC RHYKHAEMRK MSIKQWIKHR GSKIEGTGNW KPIVQFLRHQ 
NIEFIPFLTK FKLWLHGTPK KNCIAIVGPP DTDKSYFCMS LISFLGGTVI 
SHVNSSSHFW LQPLVDAKVA LLDDATQPCW IYMDTYMRNL LDGNPMSIDR 
KHKALTLIKC PPLLVTSNID ITKEDKYKYL HTRVTTFTFP NPFPFDRNGN 
AVYELSNTNW KCFFERLSSS LDIQDSEDEE DGSNSQAFRC VPGTWRTL 

3. Plasmid: WRG7077 11e2 c/o mut 
Gene of interest: 

The HPV11e2 gene is approximately 1.1Kb in size and a codon optimised 
sequence (for human expression) was created using a visual basic programme 
called Syngene. In addition the sequence included a codon change at amino 
acid position 111, whereby a lysine residue (AAG) in the wild type was changed 
to an alanine residue (GCC) creating a mutated gene. This change has been 
shown in the literature to inactivate the transcriptional activity of the E2 protein. 
Overlapping primers incorporating the whole gene with selected restriction sites 
at both the 5* and 3' ends were designed accordingly, and were used to 
assemble the synthetic codon optimised mutant 11 e2. 

Cloning: 

The 1.2kb PCR fragment was gel purified and digested with restriction enzymes 
Not I and Bam HI for ligation into vector pWRG7077 (Powderject). The gene is 
under control of the full immediately early CMV promoter and has a bovine 
growth hormone poly A tail. 

Clones that were sequenced had indicated a number of base errors, these were 
subsequently corrected. A final clone F1 was found to be codon optimised 
mutated 11E2. 

Ile2 sequence in pWRG7077 (Seq. ID No. 5) 
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ATGGAAGCCATCGCGAAGAGGCTCGACGCCTGCCAGGACCAGCTGCTCGAGCTGTACGA 
GGAGAACAGCATTGACATCCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGA 
GCGTGCTGTTGCACAAGGCCAAGCAGATGGGCCTGTCCCACATAGGCCTTCAGGTGGTC 
CCCCCTCTGACCGTGTCAGAGACAAAGGGCCATAACGCAATCGAGATGCAGATGCACCT 
CGAGTCGCTGGCGAAAACACAGTACGGCGTGGAGCCATGGACCCTGCAGGACACCTCGT 
ACGAAATGTGGCTGACCCCACCTAAGCGATGCTTCGCCAAACAGGGCAACACAGTGGAG 
GTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTATGTCGTGTGGACGCACATCTA 
TCTGCAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACGCGAAGGGCATCT 
ACTATACCTGTGGGCAGTTTAAAACCTACTATGTGAACTTCAACAAAGAGGCCCAAAAG 
TATGGCTCCAC'CAACCACTGGGAGGTCTGCTATGGGAGCACGGTGATTTGCTCTCCCGC 
CAGCGTGTCTAGCACTGTGCGCGAGGTGAGCATTGCCGAGCCGACCACGTACACCCCTG 
CCCAGACGACCGCTCCGACCGTGTCTGCTTGTACTACCGAGGACGGCGTGAGCGCTCCA 
CCCAGGAAGCGTGCGAGGGGCCCAAGCACCAACAACACCCTCTGTGTGGCGAACATTCG 
CAGCGTCGACAGTACCATCAATAACATCGTGACGGATAACTATAACAAGCACCAGAGGC 
GTAACAACTGTCACTCTGCCGCAACCCCCATCGTGCAGCTCCAGGGAGACAGCAATTGC 
•CTTAAGTGCTTCCGCTATCGCCTCAACGACAAGTACAAGCACCTCTTTGAGCTCGCCTC 
GTCGACGTGGCACTGGGCCTCACCCGAGGCACCTCACAAGAACGCCATCGTCACTCTCA 
CTTACTCCAGTGAGGAGCAGAGACAGCAGTTTCTGAACAGCGTGAAGATCCCACCGACG 
ATCCGTCATAAGGTCGGCTTCATGTCACTGCATCTCCTGTGA 

Amino acid sequence (Seq. ID No. 6) 

MEAIAKRLDA CQDQLLELYE ENSIDIHKHI MHWKCIRLES VLLHKAKQMG LSHIGLQWP 
PLTVSETKGH NAIEMQMHLE SLAKTQYGVE PWTLQDTSYE MWLTPPKRCF AKQGNTVEVK 
FDGCEDMVME YVVWTHIYLQ DNDSWVKVTS SVDAKGIYYT CGQFKTYYVN FNKEAQKYGS 
TNHVJEVCYGS TVICSPASVS STVREVSIAE PTTYTPAQTT APTVSACTTE DGVSAPPRKR 
ARGPSTHNT1 CVANIRSVDS TINNIVTDNY NKHQRRNNCH SAATPIVQLQ GDSNCLKCFR 
YRLNDKYKBL FELASSTWHW ASPEAPHKNA IVTLTYSSEE QRQQFLNSVK IPPTIRHKVG 
FMSLHLL 

4. Plasmid: HPV102 (p7313me 1 1e2 c/o mut) 



Gene of interest: 
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Codon optimised mutated 11e2 was transferred from pWRG7077 11e2 do mut 
into another expression vector p7313me. 

Cloning; 

The 11e2 do mut fragment was cut out of pWRG7077 11e2 vector by Bam HI 
and Not I restriction enzymes. This fragment was then ligated into p7313me 
vector using these sites. 

Ile2 sequence in HPV102 (Seq. ID No. 7) 

ATGGAAGCCATCGCGAAGAGGCTCGACGCCTGCCAGGACCAGCTGCTCGAGCTGTACGA 

GGAGAACAGCATTGACATCCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGA 

GCGTGCTGTTGCACAAGGCCAAGCAGATGGGCCTGTCCCACATAGGCCTTCAGGTGGTC 

CCCCCTCTGACCGTGTCAGAGACAAAGGGCCATAACGCAATCGAGATGCAGATGCACCT 

CGAGTCGCTGGCGAAAACACAGTACGGCGTGGAGCCATGGACCCTGCAGGACACCTCGT 

ACGAAATGTGGCTGACCCCACCTAAGCGATGCTTCGCCAAACAGGGCAACACAGTGGAG 

GTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTATGTCGTGTGGACGCACATCTA 

TCTGCAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACGCGAAGGGCATCT 

ACTATACCTGTGGGCAGTTTAAAACCTACTATGTGAACTTCAACAAAGAGGCCCAAAAG 

TATGGCTCCACCAACCACTGGGAGGTCTGCTATGGGAGCACGGTGATTTGCTCTCCCGC 

CAGCGTGTCTAGCACTGTGCGCGAGGTGAGCATTGCCGAGCCGACCACGTACACCCCTG 

CCCAGACGACCGCTCCGACCGTGTCTGCTTGTACTACCGAGGACGGCGTGAGCGCTCCA 

CCCAGGAAGCGTGCGAGGGGCCCAAGCACCAACAACACCCTCTGTGTGGCGAACATTCG 

CAGCGTCGACAGTACCATCAATAACATCGTGACGGATAACTATAACAAGCACCAGAGGC 

GTAACAACTGTCACTCTGCCGCAACCCCCATCGTGCAGCTCCAGGGAGACAGCAATTGC 

CTTAAGTGCTTCCGCTATCGCCTCAACGACAAGTACAAGCACCTCTTTGAGCTCGCCTC 

GTCGACGTGGCACTGGGCCTCACCCGAGGCACCTCACAAGAACGCCATCGTCACTCTCA 

CTTACTCCAGTGAGGAGCAGAGACAGCAGTTTCTGAACAGCGTGAAGATeCCACCGACG 

ATCCGTCATAAGGTCGGCTTCATGTCACTGCATCTCCTGTGA 

Amino acid sequence (Seq. ID No. 8) 

MEAIAKRLDA CQDQLLELYE ENSIDIHKHI MHWKCIRLES VLLHKAKQMG LSHIGLQWP 
PLTVSETKGH MAIEMQMHLE SLAKTQYGVE PWTLQDTSYE MWLTPPKRCF AKQGNTVEVK 
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10 



15 



FDGCEDNVME YWWTHIYLQ DNDSWVKVTS SVDAKGIYYT CGQFKTYYVN FNKEAQKYGS 
TNHWEVCYGS TVICSPASVS STVREVSIAE PTTYTPAQTT APTVSACTTE DGVSAPPRKR 
ARGPSTNNTL CVANIRSVDS TINNIVTDNY NKHQRRNNCH SAATPIVQLQ GDSNCLKCFR 
YRLNDKYKHL FELASSTWHW ASPEAPHKNA IVTLTYSSEE QRQQFLNSVK IPPTIRHKVG 
FMSLHLL 

5. Plasmid: HPV104 (p7313me 6b/11e2c/o mut) 
Gene of interest: 

A fusion protein of 6be2 and 1 1 e2 was constructed using 2x PCR with HPV1 02 
and HP V1 1 0 as templates and appropriate designed primers. The fusion 
fragment ~2.2kb was cloned into p7313me expression vector with the 6be2 at 

the beginning of the fusion protein. 

Cloning: 



The 2.2kb fusion was digested with Bam Hi and Not I restriction enzymes and 
ligated into P 7313me expression vector. Isolated clones were checked by 
20 sequencing and indicated no errors had been incorporated 

6b/11e2 fusion sequence in HPV104 (Seq. ID No. 9) 

ATGGAAGCTATTGCCAAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGTACGAGGAAAACAG 
25 CACAGACCTCCAC^GCACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTGCTCCTGTACAAGG 
CCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTGCCCCCGCTGAAGGTGAGCGAAGCCAAG 
GGCCACAACGCTATCGAGATGCAGATGCACCTGGAGAGCCTGCTGCGGACCGAATACAGCATGGAGCC 
CTGGACTCTCCAGGAGACGTCCTACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCGCAAAGCGCG 
GCAAGACAGTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGTGGTGTGGACCGAT 
30 GTCTACGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCAAGGGCATCTATTA 
CACCTGCGGGCAGTXCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAGTATGGTTCCACCA 
AGCACTGGGAGGTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTCGTCCACCACCCAG 
GAAGTGAGCATTCCGGAGAGCACCACATACACCCCGGCCCAAACGAGCACGCTCGTCAGCAGCAGCAC 
CAAGGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCAGTCTCCCTGCAATG 
35 CCCTGTGCGTTGCTCACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATCACGAACAACCACGAC 
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CAGCACCAAAGGCGCAACAACTCTAAC^GCTCCG 

CTGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTTCGACTTGATCAGTTCCA 
CGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGGTGACCTACGACTCCGAG 
GAGCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCGACAATCAGCCACAAGCTTGGCTTCAT 
5 GTCCCTGCACCTGCTGATGGAAGCCATCGCGAAGAGGCTCGACGCCTGCCAGGACCAGCTGCTCGAGC 
TGTACGAGGAGAACAGCATTGACATCCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGAGC 
GTGCTGTTGCACAAGGCCAAGCAGATGGGCCTGTCCCACATAGGCCTTCAGGTGGTCCCCCCTCTGAC 
CGTGT CAGAG A CAAAG GG CCATAACGCAATCGAGATG C AGATG CAC CT CGAGTCGCTGGCGAAAACAC 
AGTACGGCGTGGAGCCATGGACCCTGCAGGACACCTCGTACGAAATGTGGCTGACCCCACCTAAGCGA 

10 TGCTTCGC CAAAC AGGG CAACAC AGTGGAGGTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTA 

TGTCGTGTGGACGCACATCTATCTGCAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACG 
CGAAGGGCATCTACTATACCTGTGGGCAGTTTAAAACCTACTATGTGAACTTCAACAAAGAGGCCCAA 
AAGTATGGCTCCACCAACCACTGGGAGGTCTGCTATGGGAGCACGGTGATTTGCTCTCCCGCCAGCGT 
GTCTAGCACTGTGCGCGAGGTGAGCATTGCCGAGCCGACCACGTACACCCCTGCCCAGACGACCGCTC 

15 CGACCGTGTCTGCTTGTACTACCGAGGACGGCGTGAGCGCTCCACCCAGGAAGCGTGCGAGGGGCCCA 
AGCACCAACAACACCCTCTGTGTGGCGAACATTCGCAGCGTCGACAGTACCATCAATAACATCGTGAC 
GGATAACTATAACAAGCACCAGAGGCGTAACAACTGTCACTCTGCCGCAACCCCCATCGTGCAGCTCC 
AGGGAGACAGCAATTGCCTTAAGTGCTTCCGCTATCGCCTCAACGACAAGTACAAGCACCTCTTTGAG 
CTCGCCTC 1 GTCOACGTGGCACTGGGCCTCACCCGAGGCACCTCACAAGAACGCCATCGTCACTCTCAC 

20 TTACT C C AGTG AGG AG CAGAGAC AGCAGTTTCTGAAC AGCGTGAAGATCCC AC CGACGATCCGTCAT A 

AGGTCGGCTTCATGTCACTGCATCTCCTGA 

Amino acid sequence (Seq. ID No. 10) 

25 M E A I AKRIjD A CQ3QLLELYE EN S TDLHKHV LHWKCMRHES VLIjYKAKQMG LSHIGMQWP 

PLKVSSAKGH NAIEMQMHLE SLLRTEYSME PWTLQETSYE MWQTPPKRCF AKRGKTVEVK 
FDGCAtfNTMD YVVWTDVYVQ DNDTWVKVHS MVDAKGIYYT CGQFKTYYYN FVKEAEKYGS 
TKHVJEVCYGS TVICSPASVS STTQEVSIPE STTYTPAQTS TLVSSSTKED AVQTPPRKRA 
RGVQQSPCNA LCVAHIGPVD SGNHNLITNN HDQHQRRNNS NSSATPIVQF QGESNCLKCF 

30 RYRLNDRHRH LFDLXSSTWH WASSKAPHKH AIVTVTYDSE EQRQQFLDW KIPPTISHKL 

GFMSLHLLME AIAKRLDACQ DQLLELYEEN SIDIHKHIMH WKCIRLESVL LHKAKQMGLS 
HIGLQWPPL TVSETKGHNA IEMQMHLESL. AKTQYGVEPW TLQDTSYEMW LTPPKRCFAK 
QGNTVEVKFD GCEDNVMEYV VWTH I YXjQDM DSWVKVTSSV DAKGIYTTCG QFKTYYVNFN 
KEAQKYGSTW HV/EVCYGSTV ICSPASVSST VREVSIAEPT TYTPAQTTAP TVSACTTEDG 

35 VSAPPRKRAR GPSTNNTLCV ANIRSVDSTI NWIVTDNYNK HQRRNNCHSA ATPIVQLQGD 
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SNCLKCFRYR LNDKYKHLFE LASSTWHWAS PEAPHKNAIV TLTYSSEEQR QQFLNSVKIP 
PTIRHKVGFM SLHLL 

6. Plasmid: HPV105 (p7313me 11/6be2 c/o mut) 

5 

Gene of interest: 

A fusion protein of 6be2 and 1 1e2 was constructed using 2x PCR with HPV102 
and HPV1 10 as templates and appropriate designed primers. The fusion 
10 fragment ~2.2kb was cloned into p7313me expression vector and with the 1 1 e2 
at the beginning of the fusion protein. 

Cloning: 

15 The 2.2kb fusion was digested with Bam HI and Not I restriction enzymes and 
ligated into p7313me expression vector. Isolated clones were checked by 
sequencing and indicated no errors had been incorporated. 

Il/6bi32 fusion sequence in HPV105 (Seq. ID No. 11) 

20 

ATGGAAGCCATCGCGAAGAGGCTCGACGCCTGCCAGGACCAGCTGCTCGAGCTGTACGAGGAGAACAG 
CATTGACATCCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGAGCGTGCTGTTGCACAAGG 
CCAAGCAGATGGGCCTGTCCCACATAGGCCTTCAGGTGGTCCCCCCTCTGACCGTGTCAGAGACAAAG 
GGCCATAACGCAATCGAGATGCAGATGCACCTCGAGTCGCTGGCGAAAACACAGTACGGCGTGGAGC^ 

25 ATGGACCCTGCAGGACACCTCGTACGAAATGTGGCTGACCCCACCTAAGCGATGCTTCGCCAAACAGG 
GCAACACAGTGGAGGTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTATGTCGTGTGGACGCAC 
ATCTATCTGCAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACGCGAAGGGCATCTACTA 
TACCTGTGGGCAGTTTMAACCTACTATGTGAACTTCAACAAAGAGGCCCAAAAGTATGGCTCCACCA 
ACCACTGGGAGGTCTGCTATGGGAGCACGGTGATTTGCTCTCCCGCCAGCGTGTCTAGCACTGTGCGC 

30 GAGGTGAGCATTGCCGAGCCGACCACGTACACCCCTGCCCAGACGACCGCTCCGACCGTGTCTGCTTG 
TACTACCGAGGACGGCGTGAGCGCTCCACCCAGGAAGCGTGCGAGGGGCCCAAGCACCAACAACACCC 
TCTGTGTGGCGAACATTCGCAGCGTCGACAGTACCATCAATAACATCGTGACGGATAACTATAACAAG 
CACCAGAGGCGTAACAACTGTCACTCTGCCGCAACCCCCATCGTGCAGCTCCAGGGAGACAGCAATTG 
CCTTAAGTGCTTCCGCTATCGCCTCAACGACAAGTACAAGCACCTCTTTGAGCTCGCCTCX5TCGACGT 

35 GG C A CTG GG C CT C A C C CG AG G C AC CT CAC AAG AACG C CAT C GT CACTGT CACT TACT CCAGTG AG GAG 

CAGAGACAGCAGTTTCTGAACAGCGTGAAGATCCCACCGACGATCCGTCATAAGGTCGGCTTCATGTC 
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ACTGCATCTCCTGATGGAAGCTATTGCCAAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGT 
ACGAGGAJ^CAGCACAGACCTCCACAAGCACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTG 
CTCCTGTACAAGGCCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTGCCCCCGCTGAAGGT 
GAGCGAAGCCAAGGGCCACAACGCTATCGAGATGCAGATGCACCTGGAGAGCCTGCTGCGGACCGAAT 
AC AG CATGG AGC C CTGG ACTCT CC AGGAGACGTCCTACGAAATGTGGCAGACT CCTCCGAAGCGCTGT' 
TTCGCAAAGCGCGGCAAGACA.GTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGT 
GGTGTGGACCGATGTCTACGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCA 
AGGGCATCTATTACACCTGCGGGCAGTTCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAG 
TATGGTTCCACCAAGCACTGGGAGGTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTC 
GTCCACCACCCAGGAAGTGAGCATTCCGGAGAGCACCACATACACCCCGGCCCAAACGAGCACGCTCG 
TCAGCAGCAGCACCAA.GGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCAG 
TCTCCCTGCAATGCCCTGTGCGTTGCTCACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATCAC 
GAACAACCACGACCAGCACCAAAGGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGTTGC 
AGGGGGAGTCC/AACTGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTTCGAC 
TTGATCAGTTCCACGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGGTGAC 
CTACGACTCCGAV'G>\GCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCGACAATCAGCCACA 

AGCTTGGCTTCATGTCCCTGCACCTGCTGA 

Amino acid sequence (Seq. ID No. 12) 

MEAIAKRLDA CQDQLLELYE ENSIDIHKHI MHWKCIRLES VLLHKAKQMG LSHIGLQWP 
PLTVSETKGH MAIBMQMHLE SLAKTQYGVE PWTLQDTSYE MWLTPPKRCF AKQGNTVEVK 
FDGCEDNVME YWWTHIYM DNDSWVKVTS SVDAKGIYYT CGQFKTYYVN FNKEAQKYGS 
TNHWEVCYGS TVICSPASVS STVREVSIAE PTTYTPAQTT APTVSACTTE DGVSAPPRKR 
ARGPSTNNTL CVANTRSVDS TINNIVTDNY NKHQRRNNCH SAATPIVQLQ GDSNCLKCFR 
YRLNDKYKHL FELASSTWHW ASPEAPHKNA IVTLTYSSEE QRQQFLNSVK IPPTIRHKVG 
FMSLHLLMEA IAKRLDACQE QLLELYEENS TDLHKHVLHW KCMRHESVLL YKAKQMGLSH 
IGMQVVPPLK VSEAKGHNAI EMQMHLESLL RTEYSMEPWT LQETSYEMWQ TPPKRCFAKR 
GKTVEVKFDG CANWT MD YW WTDVYVQDND TWVKVHSMVD AKGIYYTCGQ FKTYYVNFVK 
EAEKYGSTKH WEVCYGSTVI CSPASVSSTT QEVSIPESTT YTPAQTSTLV SSSTKEDAVQ 
TPPRKRARGV QQSPCNALCV AHIGPVDSGN HNLITWNHDQ HQRRNNSNSS ATPIVQFQGE 
SNCLKCFRYR LNDRHRHLFD LISSTWHWAS SKAPHKHAIV TVTYDSEEQR QQFLDWKIP 
PTISHICLGFM SLHLL 



7. Plasmid: HPV108 (p7313ie 6be1 c/o mut) 
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Gene of interest: 

Codon optimised mutated 6be1 was transfered from p7313plc 6be1 c/o mut 
clone N into vector p7313ie. 

Cloning: 

The 6be1 c/o mut fragment was cut out of the p7313plc 6be2 clone by Not I and 
Bam HI restriction digests. This fragment was then ligated into p7313ie vector 
using these 

sites. The gene is under the control of the ie promoter (immediate early cmv + 
exonl) and followed by a rabbit b-gtobin poly-ad enylation signal. 

6be1 sequence in p7313ie (Seq. ID No. 13) 

ATGGCAGACGATTCCGGTACTGAGAACGAAGGTTCTGGTTGTACCGGTTGGTTCATGGTTGAAGCAAT 

CGTT C AGC ATC CG ACTGGT AC C CAG ATCTC CGATGACGAAGACGAAGAAGTTGAAGATTCTGGTTACG 

ACATGGTTGACTTCATCGATGACTCCAACATCACTCATAACTCTCTGGAAGCACAGGCTCTGTTTAAC 

CGCCAUGA/vUCTGATACCCATTACGCTACTGTTCAGGACCTGGGAGGCAAATATCTGGGCTCTCCGTA 

CGTTTCCCCGA'JX'AAC^^CTATCGCAGAAGCAGTTGAGTCTGAAATCTCCCCGCGCCTGGACGCTATCA 

AACTGACTCGTCAGCCGAAGAAGGTTAAACGTCGTCTGTTCCAGACTCGTGAACtGACCGACTCCGGT 

TACGGTTATAGCGAAGTTGAGGCTGGCACCGGCACCCAGGTTGAAAAACACGGTGTACCGGAAAACGG 

CGGCGACGGTCAGGAAAAGGACACCGGCCGCGACATCGAGGGTGAGGAACACACCGAAGCTGAAGCTC 

CGACTAACTCTGTTCGTGAACACGCAGGTACTGCGGGTATCCTGGAACTGCTGAAATGCAAAGACCTG 

CGCGCGGCTCl'GCTGGGCAAATTCAAAGAATGCTTCGGCCTGTCTTTCATTGACCTGATCCGTCCGTT 

TAAGTCTGACAAAACTACCTGTCTGGACTGGGTTGTAGCAGGCTTCGGCATCCACCACTCTATCTCTG 

AAGCATTCCAGAAACTGATCGAGCCGCTGTCTCTGTACGCGCACATCCAGTGGCTGACTAACGCTTGG 

GGTATGGTTCTGCTGGTACTGCTGCGCTTTAAAGTAAACAAATCTCGTTCCACTGTTGCTCGTACTCT 

GGCTACCCTGCTGAACATCCCGGAGAACCAGATGCTGATCGAACCGCCGAAAATCCAGTCTGGTGTAG 

CTGCACTGTACTGGTTTCGTACTGGCATCTCTAACGCTAGCACTGTTATCGGTGAAGCACCGGAATGG 

ATCACTCGTCAGACCGTTATCGAACACGGTCTGGCAGATTCTCAGTTCAAACTGACTGAAATGGTTCA 

GTGGGCATACGACAACGACATCTGCGAGGAATCTGAAATTGCGTTCGAATACGCTCAGCGTGGCGACT 

TCGACTCCAACGCTCGTGCTTTCCTGAACAGCAACATGCAGGCTAAATACGTAAAAGACTGCGCTACC 
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ATGTGCCGTCACTACAAACACGCGGAAATGCGTAAAATGTCTATCAAACAGTGGATCAAGCACCGCGG 
TTCTAAAATCGAAGGTACCGGTAACTGGAAACCGATCGTTCAGTTCCTGCGCCATCAGAACATCGAAT 
TCATCCCGTTCCTGACCAAATTCAAGCTGTGGCTGCACGGTACCCCGAAAAAAAACTGCATCGCTATC 
GTAGGTCCACCGGACACTGACAAGTCTTACTTCTGTATGTCCCTGATCTCTTTCCTGGGCGGCACTGT 
AATCTCTCACGTTA^CTCTTCCTCCCATTTCTGGCTGCAGCCACTGGTAGACGCGAAAGTAGCTCTGC 

TGGACGACG CG ACCC AGCCGTGCTGGAT CTAC ATGGATACTTACATGCGCAAC CTGCTGGACGGT AAC 
CCGATGTCTATCGACCGTAAACACAAAGCGCTGACTCTGATCAAGTGCCCGCCGCTGCTGGTAACTTC 
TAACATCGACATCACCAAGGAAGATAAATACAAGTACCTGCATACCCGTGTTACTACCTTTACTTTCC 
CGAACCCGl'TCCCGTTTGATCGTAACGGTAACGCTGTTTACGAACTGTCCAACACTAACTGGAAATGC 
TTCTTCGAGCGTCTGTCTTCCTCCCTGGACATCCAGGACTCTGAAGATGAAGAAGATGGTTCTAACTC 
TCAGGCTTTCCGTTGTGTTCCGGGTACTGTTGTTCX3TACTCTGTGA 

Amino acid sequence (Seq. ID No. 14) 

MADDSGTENE GSGCTGVJ FMV EAIVQHPTGT QISDDEDEEV EDSGYDMVDF IDDSNITHNS 
LEAQALFNRQ EADTHYATVQ DLGGKYLGSP YVSPINTIAE AVESEISPRL DAIKLTRQPK 
KVKRRLFQTR ELTDSGYGYS EVEAGTGTQV EKHGVPENGG DGQEKDTGRD IEGEEHTEAE 
APTNSVREHA GTAGILELLK CKDLRAALLG KFKECFGLSF IDLIRPFKSD KTTCLDWWA 
GFGIHHSISE AFQKLIEPLS LYAHIQWLTN AWGMVLLVLL RFKVNKSRST VARTLATLLN 
IPENQMLIEP PKIQSGVAAL YWFRTGISNA STVIGEAPEV7 ITRQTVIEHG LADSQFKLTE 
MVQWAYDNDI CEESEIAFEY AQRGDFDSNA RAFLNSNMQA KYVKD CATMC RHYKHAEMRK 
MSIKQWIKHR GSKIEGTGNW KPIVQFLRHQ NIEFIPFLTK FKLWLHGTPK KNCIAIVGPP 
. DTDKSYFCMS LISFLGGTVI SHVNSSSHFW LQPLVDAKVA LLDDATQPCW I YMDT YMRNXj 
LDGNPMSIDR KHKALTLIKC PPLLVTSNID ITKEDKYKYL HTRVTTFTFP NPFPFDRNGN 
AVYELSOTNW KCFFERLSSS LDIQDSEDEE DGSNSQAFRC VPGTWRTL 

8. Plasmid: HPV110 (p7313ie 6be2 c/o mut) 
Gene of interest: 

Codon optimised mutated 6be2 was transferred from pWRG7077 6be2 into 

vector p7313ie. 



Cloning: 
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The 6be2 c/o mut fragment was cut out of pWRG7077 6be2 clone by Not I and 
Bam hi resulctlon digests. This fragment was then ligated into p7313ie vector 
using these sites. The gene is under the control of the ie promoter (immediate 
early cmv + exonl) and followed by a rabbit £>-globin poly-adenylation signal. 

5 

6be2 sequence in p7313ie (Seq. ID No. 15) 

ATGGAAGCTATTGCCAAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGTACGAGGAAAACAG 
CAC AGAC CTC C A CAAG CACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTGCTCCTGTACAAGG 

10 CCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTGCCCCCGCTGAAGGXGAGCGAAGCCAAG 
GGCCACAACGCTATCGAGATGCAGATGCACCTGGAGAGCCTGCTGCGGACCGAATACAGCATGGAGCC 
CTGGACTCTCCAGGAGACGTCCTACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCGCAAAGCGCG 
GCAAGACAGTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGTGGTGTGGACCGAT 
GTCTACGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCAAGGGCATCTATTA 

15 CACCTGCGGGCAGTTCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAGTATGGTTCCACCA 
AGCACTGGGAGGTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTCGTCCACCACCCAG 
GAAGTGAGCATTCCGGAGAGCACCACATACACCCCGGCCCAAACGAGCACGCTCGTCAGCAGCAGCAC 
CAAGGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCAGTCTCCCTGCAATG 
CCCTGTGC^TTCKU'CACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATCACGAACAACCACGAC 

20 CAGCACCAAAGGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGTTCCAGGGGGAGTCCAA 
CTGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTTCGACTTGATCAGTTCCA 
CGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGGTGACCTACGACTCCGAG 
GAGCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCGACAATCAGCCACAAGCTTGGCTTCAT 

GTCCCTGCACCTGCTGTGA 

25 

Axaino acid sequence (Seq, ID No. 16) 

MEAIAKRLDA CQEQLLELYE ENSTDLHKHV LHWKCMRHES VLLYKAKQMG LSHIGMQWP 
PLKVSEAKGH NA I EMQMHLE SLLRTEYSME PWTLQETSYE MWQTPPKRCF- AKRGKTVEVK 
30 FDGCANMTMD YWWTDVYVQ DNDTWVKVHS MVDAKGIYYT CGQFKTYYVN FVKEAEKYGS 

TKHWEVCYGS 'IV X C3 PASVS STTQEVSIPE STTYTPAQTS TLVSSSTKED AVQTPPRKRA 
RGVQQSPCNA LCVAHIGPVD SGNHNLITNN HDQHQRRNNS NSSATPIVQF QGESNCLKCF 
RYRLNDRHRH LFDLISSTWH WASSKAPHKH AIVTVTYDSE EQRQQFLDW KIPPTISHKL 
GFMSLHXjL 

35 
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9. Piasmio: HPV'i16 (p7313ie Gbe1.6be2.11e2) 
Gene of interest: 

5 The gene for the polyprotein in construct HPV116 is a triple fusion protein 
comprised in order of 6be1 f 6be2, 11e2 ail codon optimised and mutated. The 
polyprotein gene was assembled by PCR from using 2 previous PGR fragments; 
6be1 and Bb/11e2. The size of the gene is ~4.1kb, producing a polyprotein of 
~170kD,observed by PAGE and Western blot. 

10 



Cloning: 

15 The polyprotein gene was digested with Bam HI + Not I restriction enzymes and 
ligated into p7313ie vector. Sequencing analysis of selected clones had 
indicated the 'odd' base change, but this was overcome by various fragment 
swapping. A resulting clone hpv116 #1 was found to have no errors. 

20 

Polyprotein sequence in HPV116 (Seq. ID No. 17) 

ATGGCAGACGATTCCGGTACTGAGAACGAAGGTTCTGGTTGTACCGGTTGGTTCATGGTTGAAGCAA 
T CGTTCAGCAX CCG A CTG GTAC CC AGATCTC CGATGACGAAGACGAAGAAGTTGAAGATTCTGGTTA 

25 CGACATGGTTGACTTCATCGATGACTCCAACATCACTCATAACTCTCTGGAAGCACAGGCTCTGTTT 
AACCGCCAGGAAGCTGATACCCATTACGCTACTGTTCAGGACCTGGGAGGCAAATATCTGGGCTCTC 
CGTACGTTTCCCCGATCAACACTATCGCAGAAGCAGTTGAGTCTGAAATCTCCCCGCGCCTGGACGC 
TATCAAACTGACTCGTCAGCCGAAGAAGGTTAAACGTCGTCTGTTCCAGACTCGTGAACTGACCGAC 
TCCGGTTACGGTTATAGCGAAGTTGAGGCTGGCACCGGCACCCAGGTTGAAAAACACGGTGTACCGG 

30 AAAACGGCGG CGACGGTCAGGAAAAGGACACCGGCCGCGACATCGAGGGTGAGGAACACAC CGAAG C 

TGAAGCTCCGACTAACTCTGTTCGTGAACACGCAGGTACTGCGGGTATCCTGGAACTGCTGAAATGC 
AAAGACCTGCGCGCGGCTCTGCTGGGCAAATTCAAAGAATGCTTCGGCCTGTCTTTCATTGACCTGA 
TCCGTCCGTTTAAGTCTGACAAAACTACCTGTCTGGACTGGGTTGTAGCAGGCTTCGGCATCCACCA 
CTCrATCTCTGAAGCATTCCAGAAACTGATCGAGCCGCTGTCTCTGTACGCGCACATCCAGTGGCTG 

35 ACTAACGCTTGGGGTATGGTTCTGCTGGTACTGCTGCGCTTTAAAGTAAACAAATCTCGTTCCACTG 
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TTGCTCGTACTCTGGCTACCCTGCTGAACATCCCGGAGAACCAGATGCTGATCGAACCGCCGAAAAT 

CCAGTCTGGTGTAGCTGCACTGTACTGGTTTCGTACTGGCATCTCTAACGCTAGCACTGTTATCGGT 

GAAGCACCGGA^TGGATCACTCGTCAGACCGTTATCGAACACGGTCTGGCAGATTCTCAGTTCAAAC 

TGACTGAAATGGTTCAGTGGGCATACGACAACGACATCTGCGAGGAATCTGAAATTGCGTTCGAATA 

CGCTCAGCGTGGCGACTTCGACTCCAACGCTCGTGCTTTCCTGAACAGCAACATGCAGGCTAAATAC 

GTAAAAGACTGCGCTACCATGTGCCGTCACTACAAACACGCGGAAATGCGTAAAATGTCTATCAAAC 

AGTGGATCAAGCACCGCGGTTCTAAAATCGAAGGTACCGGTAACTGGAAACCGATCGTTCAGTTCCT 

GCGCCATCAGAACATCGAATTCATCCCGTTCCTGACCAAATTCAAGCTGTGGCTGCACGGTACCCCG 

AAAAAAAACTGCATCGCTATCGTAGGTCCACCGGACACTGACAAGTCTTACTTCTGTATGTCCCTGA 

TCTCTTTCCTGGGCGGCACTGTAATCTCTCACGTTAACTCTTCCTCCCATTTCTGGCTGCAGCCACT 

GGTAGACCC;UA]>J\GTAGCTCTGCTGGACGACGCGACCCAGCCGTGCTGGATCTACATGGATACTTAC 

ATGCGCA^CCTGCTGGACGGTAACCCGATGTCTATCGACCGTAAACACAAAGCGCTGACTCTGATCA 

AGTGCCCGCCGCTGCTGGTAACTTCTAACATCGACATCACCAAGGAAGATAAATACAAGTACCTGCA 

TACCCGTGTTACTACCTTTACTTTCCCGAACCCGTTCCCGTTTGATCGTAACGGTAACGCTGTTTAC 

GAACTGTCCAACACTAACTGGAAATGCTTCTTCGAGCGTCTGTCTTCCTCCCTGGACATCCAGGACT 

CTGAAGATGAAGAAGATGGTTCTAACTCTCAGGCTTTCCGTTGTGTTCCGGGTACTGTTGTTCGTAC 

TCTGATGGAAGCTATTGCC'AAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGTACGAGGAA 

AACAGCACAGACCTCCACAAGCACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTGCTCCTGT 

ACAAGGCCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTGCCCCCGCTGAAGGTGAGCGA 

AGCCAAGGGCCACAACGCTATCGAGATGCAGATGCACCTGGAGAGCCTGCTGCGGACCGAATACAGC 

ATGGAGCCCTGGACTCTCCAGGAGACGTCCTACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCG 

CAAAGCGCGGCAAGACAGTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGTGGT 

GTGGACCGATGTCTACGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCAAG 

GGCATCTATTACACCTGCGGGCAGTTCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAGT 

ATGGTTCCACCAAGCACTGGGAGGTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTC 

GTCCACCACCCAGGAAGTGAGCATTCCGGAGAGACCACATACACCCCGGCCCAAACGAGCACGCTCG 

T^GCAGCAGCAGCAAGGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCA 

GTCTCCCTGCAAXGCCCTGTGCGTTGCTCACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATC 

ACGAACAACCACGACCAGCACCAAAGGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGT 

TCCAGGGGGAGTCCAACTGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTT 

CGACTTGATCAGTTCCACGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACG 

GTGACCTACGACTCCGAGGAGCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCGACAATCA 

GCCACAAGCTTGGCTTCATGTCCCTGCACCTGCTGATGGAAGCCATCGCGAAGAGGCTCGACGCCTG 

CCAGGACCAGCTGCTCGAGCTGTACGAGGAGAACAGCATTGACATCCATAAGCACATCATGCACTGG 

AAGTGCATTCGCCTGGAGAGCGTGCTGTTGCACAAGGCCAAGCAGATGGGCCTGTCCCACATAGGCC 
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TTC AGGTGGT C C CC C CTCTG ACCGTGTCAGAGACAAAGGGCCATAACGCAATCGAG ATGCAGATGCA 
CCTCGAGTCGCTGGCGAAAACACAGTACGGCGTGGAGCCATGGACCCTGCAGGACACCTCGTACGAA 
ATGTGGCTGACCCCACCTAAGCGATGCTTCGCCAAACAGGGCAACACAGTGGAGGTGAAGTTCGACG 
GCTGTGAG rjiVj 1 .CCiTTATGGAGTATGTCGTGTGGACGCACATCTATCTGCAGGACAACGACAGTTG 
GGTGAAGGTGACCAGCTCCGTGGACG CGAAGGGC AT CTACTATACCTGTGGGCAGTTTAAAAC CTAC 
TA.TGTGAACTTCAACAAAGAGGCCCAAAAGTATGGCTCCACCAACCACTGGGAGGTCTGCTATGGGA 
GCACGGTGATTTGCTCTCCCGCCAGCGTGTCTAGCACTGTGCGCGAGGTGAGCATTGCCGAGCCGAC 
C ACGTACACC CCTG CCC AG ACGACCGCTCCGACCGTGTCTGCTTGTACTACCGAGGACGGCGTGAGC 
GCTCCACCCAGGAAGCGTGCGAGGGGCCCAAGCACCAACAACACCCTCTGTGTGGCGAACATTCGCA 
GCGTCGACAGTACXV^TCAATAAGATCGTGACGGATAACTATAACAAGCACGAGAGGCGTAACAACTG 
TCACTCTGCCGC^CCCCCATCGTGCAGCTCCAGGGAGACAGCAATTGCCTTAAGTGCTTCCGCTAT 
CGCCTCAACGACAAGTACAAGCACCTCTTTGAGCTCGCCTCGTCGACGTGGCACTGGGCCTCACCCG 
AGGCACCTCACAAGAACGCCATCGTCACTCTCACTTACTCCAGTGAGGAGCAGAGACAGCAGTTTCT 
GAACAGCGTGAAGATCCCACCGACGATCCGTCATAAGGTCGGCTTCATGTCACTGCATCTCCTGTGA 

Amino siciJ .ence (Seq. ID No. 18) 

MADDSGTENE GSGCTGWFMV EAIVQHPTGT QISDDEDEEV EDSGYDMVDF IDDSNITHNS 
LEAQALFNRQ EADTHYATVQ DLGGKYLGSP YVSPINTIAE AVESEISPRL DAIKLTRQPK 
KVKRRLFQTR SLTDSGYGYS EVEAGTGTQV EKHGVPENGG DGQEKDTGRD IEGEEHTEAE 
A PTNS VR E HA GTAGILELLK CKDLRAALLG KFKECFGLSF IDLIRPFKSD KTTCLDWWA 
GFGIHHSISE AFQKLIEPLS LYAHIQWLTN AWGMVLLVLL RFKVNKSRST VARTLATLLN 
IPENQMLIEP PKIQSGVAAL YWFRTGI SNA STVIGEAPEW ITRQTVIEHG LADSQFKLTE 
MVQWAYDNDI CEESEIAFEY AQRGDFDSNA RAFLNSNMQA KYVKD CATMC RHYKHAEMRK 
MSIKQWIKHR GSKIEGTGNW KPIVQFLRHQ NIEFIPFLTK FKLWLHGTPK KNCIAIVGPP 
DTDKSYFCMS LI SFLGGTVI SHVWSSSHFW LQPLVDAKVA LLDDATQPCW IYMDTYMRNL 
LDGNPMSIDR KUKALTLIKC PPLLVTSNID ITKEDKYKYL HTRVTTFTFP NPFPFDRUGN 
AVYELSNTNW KCFFERLSSS LDIQDSEDEE DGSNSQAFRC VPGTWRTLM EAIAKRLDAC 
QEQLLELYEE NSTDLHKHVL HWKCMRHESV LLYKAKQMGL SHIGMQWPP LKVSEAKGHN 
AIEMQMHLES LLRTSYSMEP WTLQBTSYEM WQTPPKRCFA KRGKTVEVKF DGCANNTMDY 
WWTDVYVQD NDTWVKVHSM VDAKGIYYTC GQFKTYYVNF VKEAEKYGST KHWEVCYGST 
VICSPASVSS TTQEVSIPES TTYTPAQTST LVSSSTKEDA VQTPPRKRAR GVQQSPCNAL 
CVAHIGPVDS GWHNLITNNH DQHQRRNNSN SSATPIVQFQ GESNCLKCFR YRLNDRHRHL 
PDLISSTWHW AS S KAPH KHA IVTVTYDSEE QRQQFLDWK IRPTISHKLG FMSLHLLMEA 
IAKRLDACQD QLLELYEENS IDIHKHIMHW KCIRIjESVLIj HKAKQMGLSH IGLQWPPLT 
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VSETKGHNAI EMQMHLESLA KTQYGVEPWT LQDTSYEMWL TPPKRCFAKQ GNTVEVKFDG 
CEDNVMEYW WTHIYLQDND SWVKVTSSVD AKGIYYTCGQ FKTYYVNFNK EAQKYGSTNH 
WEVCYGSTVi OSPASVSSTV REVSIAEPTT YTPAQTTAPT VSACTTEDGV SAPPRKRARG 
PSTNNTLCVA NIRSVDSTIN NIVTDWYNKH QRRNNCHSAA TPIVQLQGDS NCLKCFRYRL 
5 NDKYKHLFEL ASSTWHWASP EAPHKNAIVT LTYSSEEQRQ QFLNSVKIPP TIRHKVGFMS 

LHLL 

10. Plasmid: HPV117 (p7313ie 6be2.6be1.11e2) 
10 Gene of interest: 

The gene for the polyprotein in construct HPV117 is a triple fusion protein 
comprised in order of 6be2, 6be1, 11e2 all codon optimised and mutated. The 
polyprotein gene was assembled by PCR from using 3 previous PCR fragments; 
15 6be1 ana 6be2 and11e2. The size of the gene is ~4.1kb, producing a polyprotein 
of ~170kD, observed by PAGE and. Western blot. 

Cloning: 

20 The polyprotein gene was digested with Bam HI + Not I restriction enzymes and 
ligated into p7313ie vector. Sequencing analysis of selected clones had 
indicated the 'odd 1 base change, but this was overcome by various fragment 
swapping. A resulting clone hpvl 17 #6 was found to have no errors. 

25 

Polyprotein sequence in HPV117 (Seq. ID No. 19) 

ATGGAAG CTATTG C C AAGCGACTGGACG CCTG CCAGGAGCAGCTG CTGGAGCTGTACGAGGAAAACAG 
30 CACAGACCTCCACAAGCACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTGCTCCTGTACAAGG 
C C AAG C AG ATGGGG C T GT C C C AC AT CGG GATG CAGGTCGTG C C C CCG CTGAAG GT GAG CGAAG CC AAG 
GGCC AC AACG CTK'I CG AG A.TGC AGATG CACCTGG AGAGCCTGCTGCGG ACCGAATACAGCATGG AGC C 
CTGGACTCTCCAGGAGACGTCCTACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCGCAAAGCGCG 
GCAAGACAGTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGTGGTG0X3GACCGAT 
35 GTCT ACGTG C AGGACAATG ACA CCTGGGTGAAGGTACAT AGTATGGTGGATG CCAAGGGCATC TATTA 
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CACCTQCQGGCAGTTCRAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAGTATGGTTCCACCA 

AGCACTGGGAGCTGTGTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTCGTCCACCACCCAG 

GAAGTGAGCATTCCGGAGAGCACCACATACACCCCGGCCCAAACGAGCACGCTCGTCAGCAGCAGCAC 

CAAGGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCAGTCTCCCTGCAATG 

CCCTGTGCGTTGCTCACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATCACGAACAACCACGAC 

CAGCACCAAAGGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGTTCCAGGGGGAGTCCAA 

CTGCCTCAAGTGTTTCCC-CTACCGCCTCAACGACCGCCACCGCCACCTGTTCGACTTGATCAGTTCCA 

CGTGGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGGTGACCTACGACTCCGAG 

GAGCAGAGGCAGCAGTTCCTGGACGTCGTGAAGATTCCTCCGACAATCAGCCACAAGCTTGGCTTCAT 

GTCCCTGCACCTGCTGATGGCAGACGATTCCGGTACTGAGAACGAAGGTTCTGGTTGTACCGGTTGGT 

TCATGGTTGAAGCAATCGTTCAGCATCCGACTGGTACCCAGATCTCCGATGACGAAGACGAAGAAGTT 

GAAGATTCTGGTTACGACATGGTTGACTTCATCGATGACTCCAACATCACTCATAACTCTCTGGAAGC 

ACAGGCTCTGTTTAACCGCCAGGAAGCTGATACCCATTACGCTACTGTTCAGGACCTGGGAGGCAAAT 

ATCTGGGCTCTCCGTACGTTTCCCCGATCAACACTATCGCAGAAGCAGTTGAGTCTGAAATCTCCCCG 

CGCCTGGACGC'fATCAAACTGACTCGTCAGCCGAAGAAGGTTAAACGTCGTCTGTTCCAGACTCGTGA 

ACTGACCGACTCCGGTTACGGTTATAGCGAAGTTGAGGCTGGCACCGGCACCCAGGTTGAAAAACACG 

GTGTACCGGAAAACGGCGGCGACGGTCAGGAAAAGGACACCGGCCGCGACATCGAGGGTGAGGAACAC 

ACCGAAGCTGAAGCTCCGACTAACTCTGTTCGTGAACACGCAGGTACTGCGGGTATCCTGGAACTGCT 

GAAATGCAAAGACCTGCGCGCGGCTCTGCTGGGCAAATTCAAAGAATGCTTCGGCCTGTCTTTCATTG 

ACCTGATCCGTCCGTTTAAGTCTGACAAAACTACCTGTCTGGACTGGGTTGTAGCAGGCTTCGGCATC 

CACCACTCTATCTCl'GAAGCATTCCAGAAACTGATCGAGCCGCTGTCTCTGTACGCGCACATCCAGTG 

GCTGA.CTAACGCT'rGGGGTATGGTTCTGCTGGTACTGCTGCGCTTTAAAGTAAACAAATCTCGTTCCA 

CTGTTGCTCGTACTCTGGCTACCCTGCTGAACATCCCGGAGAACCAGATGCTGATCGAACCGCCGAAA 

ATCCAGTCTGGTGTAGCTGCACTGTACTGGTTTCGTACTGGCATCTCTAACGCTAGCACTGTTATCGG 

TGAAGC'ACCOGA.'.a'GGATCACTCGTCAGACCGTTATCGAACACGGTCTGGCAGATTCTCAGTTCAAAC 

TGACTGAAATGGTTCAGTGGGCATACGACAACGACATCTGCGAGGAATCTGAAATTGCGTTCGAATAC 

GCTCAGCGTGGCGACTTCGACTCCAACGCXCGTGCTTTCCTGAACAGCAACATGCAGGCTAAATACGT 

AAAAGACTGCGCTACC'ATGTGCCGTCACTACAAACACGCGGAAATGCGTAAAATGTCTATCAAACAGT 

GGATCAAGCACCGCGGTTCTAAAATCGAAGGTACCGGTAACTGGAAACCGATCGTTCAGTTCCTGCGC 

CATCAGAACATCGAATTCATCCCGTTCCTGACCAAATTCAAGCTGTGGCTGCACGGTACCCCGAAAAA 

nAAC'TGC'A'rC.'GC'!'A'4.'CGTAGGTCCACCGGACACTGACAAGTCTTACTTCTGTATGTCCCTGATCTCTT 

TCCTGGGCGGCACTGTAATCTCTCACGTTAACTCTTGCTCCCATTTCTGGCTGCAGCCACTGGTAGAC 

GCGAAAGTAGCTCTGCTGGACGACGCGACCCAGCCGTGCTGGATCTACATGGATACTTACATGCGCAA 

CCTGCTGGACGGTAACCCGATGTCTATCGACCGTAAACACAAAGCGCTGACTCTGATCAAGTGCCCGC 

cgctgctggtaacttct;aacatcgacatcaccaaggaagataaatacaagtacctgcatacccgtgtt 
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ACTACCTTXACTa^CCCGAACCCGTTCCCGTTTGATCGTAACGGTAACGCTGTTTACGAACTGTCCAA 
CACTAACTGGAAATGCTTCTTCGAGCGTCTGTCTTCCTCCCTGGACATCCAGGACTCTGAAGATGAAG 
AAGATGGTTCTAACTCTCAGGCTTTCCGTTGTGTTCCGGGTACTGTTGTTCGTACTCTGATGGAAGCC 
AT CG CG AAGAGG CT GGACGCCTG C CAGGACCAGCTGCTCGAGCTGTACG AGGAGAAC AGCATTGACAT 
CCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGAGCGTGCTGTTGCACAAGGCCAAGCAGA 
TGGGCCTGTCCCACATAGGCCTTCAGGTGGTCCCCCCTCTGACCGTGTCAGAGACAAAGGGCCATAAC 
GCAATCGAGATGCAGATGCACCTCGAGTCGCTGGCGAAAACACAGTACGGCGTGGAGCCATGGACCCT 
GCAGGACACCTCGTACGAAATGTGGCTGACCCGACCTAAGCGATGCTTCGCCAAACAGGGCAACACAG 
TGGAGGTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTATGTCGTGTGGACGCACATCTATCTG 
CAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACGCGAAGGGCATCTACTATACCTGTGG 
GCAGTTTAAAACCTACTATGTGAACTTCAACAAAGAGGCCCAAAAGTATGGCTCCACCAACCACTGGG 

aggtctgctatgggagcacggtgatttgctctcccgccagcgtgtctagcactgtgcgcgaggtgagc 
attgccg:^ccgp,ccacgtacacccctgcccagacgaccgctccgaccgtgtctgcttgtactaccga 

GGACGGCGTGAGCGCTCCACCCAGGAAGCGTGCGAGGGGCCCAAGGACCAACAACACCCTCTGTGTGG 
CGAACATTCGCAGCGTCGACAGTACCATCAATAAGATCGTGACGGATAACTATAACAAGCACCAGAGG 

C GT AAC AACTGT C A CT CTGC C G C AAC C C CC ATCGTGC AG CT CCAGGG AGAC AG CAATTG C CTT AAGTG 
CTTCCGCTATCGCCTCAACGACAAGTACAAGCACCTCTTTGAGCTCGCCTCGTCGACGTGGCACTGGG 
CCTCACCCGAGGIC^CCTCACAAGAACGCCATCGTCACTCTCACTTACTCCAGTGAGGAGCAGAGACAG 
CAGT'rTCTGAACAGCGTGZ^AGATCCCACCGACGATCCGTCATAAGGTCGGCTTCATGTCACTGCATCT 

CCTGTGA 



Amino acid sequence (Seq. ID No. 20) 

MEAIAKRLDA CQEQLLELYE ENSTDLHKHV LHWKCMRHES VLLYKAKQMG LSHIGMQWP 
PLKVSEAKGH NAIEMQMHLE SLLRTEYSME PWTLQETSYE MWQTPPKRCF AKRGKTVEVK 
FDGCANNTMD YVVWTDVYVQ DNDTWVKVHS MVDAKGIYYT CGQFKTYYW FVKEAEKYGS 
TKHWEVCYGS TVICSPASVS STTQEVSIPE STTYTPAQTS TLVSSSTKED AVQTPPRKRA 
RGVQQSPCMA LCVAHIGPVD SGNHNLITMN HDQHQRRNNS NSSATPIVQP QGESNCLKCF 
R Y RLM DKHKH &FDLISSTWH WASSKAPHKH AIVTVTYDSE EQRQQFLDW KIPPTISHKL 
GFMSLHLLMA DDSGTENEGS GCTGWFMVEA IVQHPTGTQI SDDEDEEVED SGYDMVDFID 
DSNITHNSLE AQALFNRQEA DTHYATVQDL GGKYLGSPYV SPINTIAEAV ESEISPRLDA 
IKLTRQPKKV KRRLFQTREL TDSGYGYSEV EAGTGTQVEK HGVPENGGDG QEKDTGRDIE 
GEEHTEAEAP TMSVREHAGT AGILELLKCK DLRAALLGKF KECFGLSFID LIRPFKSDKT 
TCLDV7WAGF GIHHSISEAF QKLIEPLSLY AHIQWLTNAW GMVLLVLLRF KVNKSRSTVA 
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RTLATLLNI ? ENQMLIEPPK IQSGVAALYW FRTGISNAST VIGEAPEWIT RQTVIEHGLA 
DSQFKLTEMV QWAYDNDICE ESEIAFEYAQ RGDFDSNARA FLNSNMQAKY VKDCATMCRH 
YKHAEMRKMS IKQWIKHRGS KIEGTGNWKP IVQFLRHQNI EFIPFLTKFK LWLHGTPKKN 
CIAIVGPPDT DKSYFCMSLI SFLGGTVISH WSSSHFWLQ PLVDAKVALL DDATQPGWIY 
MDTYMRNLLD GNPKSIDRKH KALTLIKCPP LLVTSMIDIT KEDKYKYLHT RVTTFTFPNP 
FPFDRNGNAV YEIiSNTNWKC FFERLSSSLD IQDSEDEEDG SNSQAFRCVP GTWRTLMEA 
IAKRLDACQD QLLELYEENS IDIHKHIMHW KCIRLESVLL HKAKQMGLSH IGLQWPPLT 
VSETKGHNAI EMQMHLESLA KTQYGVEPWT LQDTSYEMWL TPPKRCFAKQ GNTVEVKFDG 
CEDNVMEYW WTHIYLQDND SWVKVTSSVD AKGIYYTCGQ FKTYYVEFFNK EAQKYGSTNH 
WEVCYGSTVJ CSPASVSSTV REVSIAEPTT YTPAQTTAPT VSACTTEDGV SAPPRKRARG 
PSTNWa'LCVA NJRSVDSTIN NIVTDNYNKH QRRNNCHSAA TPIVQLQGDS NCLKCFRYRL 
NDKYKHLFEL ASSTWHWASP EAPHKNAIVT LTYSSEEQRQ QFLNSVKIPP TIRHKVGFMS 
LHLL 



11. Plasmid: HPV118 (p7313ie6be2.He2.6be1) 
Gene of interest: 

The gene for the polyprotein in construct HPV118 is a triple fusion protein 
comprised in order of 6be2, 11e2, 6be1 all codon optimised and mutated. The 
polyprotein gene was assembled by PCR from using 2 previous PCR fragments; 
6be1 and 11/6be2. The size of the gene is ~4.1kb, producing a polyprotein of 
~170kD,observed by PAGE and Western blot. 

Cloning: 

The polyprotein gene was digested with Bam HI + Not I restriction enzymes and 
ligated into p7313ie vector. Sequencing analysis of selected clones had 
indicated the 'odd' base change, but this was overcome by various fragment 
swapping. A resulting clone hpv118 #3 was found to have no errors. 

Polyprotein sequence in HPV118 (Seq. ID No. 21) 
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ATGGAAGCTATTGCCAAGCGACTGGACGCCTGCCAGGAGCAGCTGCTGGAGCTGTACGAGGAAAACAG 

CACAGACCTCCACAAGCACGTGCTGCACTGGAAGTGCATGCGCCACGAGTCAGTGCTCCTGTACAAGG 

CCAAGCAGATGGGGCTGTCCCACATCGGGATGCAGGTCGTGCCCCCGCTGAAGGTGAGCGAAGCCAAG 

GGCCACAACGCTATCGAGATGCAGATGCACCTGGAGAGCCTGCTGCGGACCGAATACAGCATGGAGCC 

CTGGACTCTCCAGGAGACGTCCTACGAAATGTGGCAGACTCCTCCGAAGCGCTGTTTCGCAAAGCGCG 

GCAAGACAGTTGAGGTGAAATTCGATGGGTGCGCAAACAACACGATGGACTACGTGGTGTGGACCGAT 

GTCTACGTGCAGGACAATGACACCTGGGTGAAGGTACATAGTATGGTGGATGCCAAGGGCATCTATTA 

CACCTGCGGGCAGTTCAAGACGTACTACGTCAACTTCGTCAAGGAAGCCGAAAAGTATGGTTCCACCA 

AGCACTGGGAGGTGTOTTACGGGAGTACTGTGATCTGCAGCCCCGCCTCCGTGTCGTCCACCACCCAG 

GAAGTGAGCATTOCGGAGAGCACCACATACACCCCGGCCCAAACGAGCACGCTCGTCAGCAGCAGCAC 

CAAGGAGGACGCCGTCCAGACGCCCCCCCGGAAGAGGGCCCGGGGGGTCCAGCAGTCTCCCTGCAATG 

CCCTGTGCGTTGCTCACATCGGCCCTGTCGATTCTGGGAACCACAATCTCATCACGAACAACCACGAC 

CAGCACCAAAGGCGCAACAACTCTAACAGCTCCGCAACTCCAATAGTGCAGTTCCAGGGGGAGTCCAA 

CTGCCTCAAGTGTTTCCGCTACCGCCTCAACGACCGCCACCGCCACCTGTTCGACTTGATCAGTTCCA 

CG-GGCACTGGGCCAGCAGCAAGGCGCCCCACAAACACGCTATCGTGACGGTGACCTACGACTCCGAG 

GAGCAGAGGCiiGCAG'X'TCCTGGACGTCGTGAAGATTCCTCCGACAATCAGCCACAAGCTTGGCTTCAT 

GTCCCTGC^CCTGCTGATGGAAGCCATCGCGAAGAGGCTCGACGCCTGCCAGGACCAGCTGCTCGAGC 

TGTACGAGGAGAACAGCATTGACATCCATAAGCACATCATGCACTGGAAGTGCATTCGCCTGGAGAGC 

GTGCTGTTGCACAAGGCCAAGCAGATGGGCCTGTCCCACATAGGCCTTCAGGTGGTCCCCCCTCTGAC 

CGTGTCAGAGACAAAGGGCCATAACGCAATCGAGATGCAGATGCACCTCGAGTCGCTGGCGAAAACAC 

AGTACGGCGTGGAGCCATGGACCCTGCAGGACACCTCGTACGAAATGTGGCTGACCCCACCTAAGCGA 

TGCTTCGCCAAACAGGGCAACACAGTGGAGGTGAAGTTCGACGGCTGTGAGGATAACGTTATGGAGTA 

TGTCGTGTGGACGCACATCTATCTGCAGGACAACGACAGTTGGGTGAAGGTGACCAGCTCCGTGGACG 

CGAAGGGCATCTACTATACCTGTGGGCAGTTTAAAACCTACTATGTGAACTTCAACAAAGAGGCCCAA 

AAGTATGGCTCCACCAACCACTGGGAGGTCTGCTATGGGAGCACGGTGATTTGCTCTCCCGCCAGCGT 

GTCTAGCACTGTGCGCGAGGTGAGCATTGCCGAGCCGACCACGTACACCCCTGCCCAGACGACCGCTC 

CGACCGTGTCTGCTTGTACTACCGAGGACGGCGTGAGCGCTCCACCCAGGAAGCGTGCGAGGGGCCCA 

AGCAC'CAAC'AACAC'CC'TCTGTGTGGCGAACATTCGCAGCGTCGACAGTACCATCAATAACATCGTGAC 

GGATAACTATAACAAGCACCAGAGGCGTAACAACTGTCACTCTGCCGCAACCCCCATCGTGCAGCTCC 

agggagacagcaattgccttaagtgcttccgctatcgcctcaacgacaagtacaagcacctctttgag 

CTCGCCTCGTCGACGTGGCACTGGGCCTCACCCGAGGCACCTCACAAGAACGCCATCGTCACTCTCAC 

ttactccagtgaggagcagagacagcagtttctgaacagcgtgaagatcccaccgacgatccgtcata 

AGGTCGGCTTCAXGTCACTGCATCTCCTGATGGCAGACGATTCCGGTACTGAGAACGAAGGTTCTGGT 

tgtaccggttggttcatggttgaagcaatcgttcagcatccgactggtacccagatctccgatgacga 
i agacgaagaagttgaagattctggttacgacatggttgacttcatcgatgactgcaacatcagtcata 
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ACTCTCTGGAAGCACAGGCTCTGTTTAACCOCCAGGAAGCTGATACCCATTACGCTACTGTTCAGGAC 

CTGGGAGGCAAATATCTGGGCTCTCCGTACGTTTCCCCGATCAACACTATCGCAGAAGCAGTTGAGTC 

TGAAATCTCCCCGCGCCTGGACGCTATCAAACTGACTCGTCAGCCGAAGAAGGTTAAACGTCGTCTGT 

TCCAGACTCGTGAACTGACCGACTCCGGTTACGGTTATAGCGAAGTTGAGGCTGGCACCGGCACCCAG 

GTTGAAAAACACGGTGTACCGGAAAACGGCGGCGACGGTCAGGAAAAGGACACCGGCCGCGACATCGA 

GGGTGAGGAACACACCGAAGCTGAAGCTCCGACTAACTCTGTTCGTGAACACGCAGGTACTGCGGGTA 

TCCTGGAACTGCTGAAATGCAAAGACCTGCGCGCGGCTCTGCTGGGCAAATTCAAAGAATGCTTCGGC 

CTGTCTTTCATTGACCTGATCCGTCCGTTTAAGTCTGACAAAACTACCTGTCTGGACTGGGTTGTAGC 

AGGCTTCGGCATCGACCACTCTATCTCTGAAGCATTCCAGAAACTGATCGAGCCGCTGTCTCTGTACG 

CGCACATCCA.GTGGCTGACTAACGCTTGGGGTATGGTTCTGCTGGTACTGCTGCGCTTTAAAGTAAAC 

AAATCTCGTTCCACTGTTGCTCGTACTCTGGCTACCCTGCTGAACATCCCGGAGAACCAGATGCTGAT 

CGAACCGCCGAAAATCCAGTCTGGTGTAGCTGCACTGTACTGGTTTCGTACTGGCATCTCTAACGCTA 

GCACTGTTATCGGTGAAGCACCGGAATGGATCACTCGTCAGACCGTTATCGAACACGGTCTGGCAGAT 

TCTCAGTTCAAACTGACTGAAATGGTTCAGTGGGCATACGACAACGACATCTGCGAGGAATCTGAAAT 

TGCGTTCGAATACGCTCAGCGTGGCGACTTCGACTCCAACGCTCGTGCTTTCCTGAACAGCAACATGC 

AGGCTAAATACGTAAAAGACTGCGCTACCATGTGCCGTCACTACAAACACGCGGAAATGCGTAAAATG 

TCTATCAAACAGTGGATCAAGCACCGCGGTTCTAAAATCGAAGGTACCGGTAACTGGAAACCGATCGT 

TCAGTTCCTGCGCCATCAGAACATCGAATTCATCCCGTTCCTGACCAAATTCAAGCTGTGGCTGCACG 

GTACCCCGAAAAAAAACTGCATCGCTATCGTAGGTCCACCGGACACTGACAAGTCTTACTTCTGTATG 

TCCCTGATCTCTTTCCTGGGCGGCACTGTAATCTCTCACGTTAACTCTTCCTCCCATTTCTGGCTGCA 

GCCACTG^TAGACGCGAAAGTAGCTCTGCTGGACGACGCGACCCAGCCGTGCTGGATCTACATGGATA 

CTTACATGCGCAACCTGCXGGACGGTAACCCGATGTCTATCGACCGTAAACACAAAGCGCTGACTCTG 

ATCAAGTGCCCGCCGCTGCTGGTAACTTCTAACATCGACATCACCAAGGAAGATAAATACAAGTACCT 

GCATACCCGTGTTACTACCTTTACTTTCCCGAACCCGTTCCCGTTTGATCGTAACGGTAACGCTGTTT 

ACGAACTGTCCAACACTAACTGGAAATGCTTCTTCGAGCGTCTGTCTTCCTCCCTGGACATCCAGGAC 

TCTGAAGATGAAGAAGATGGTTCTAACTCTCAGGCTTTCCGTTGTGTTCCGGGTACTGTTGTTCGTAC 

TCTGTGA 

Amino acid sequence (Seq. ID No. 22) 

MEAIAKRLDA CQEQLLELYE ENSTDLHKHV LHWKCMRHES VLLYKAKQMG LSHIGMQWP 
PLKVSEAKGH NAISMQMHLE SLLRTBYSME PWTLQETSYE MWQTPPKRCF AKRGKTVEVK 
FDGCANNTMD YWWTDVYVQ DNDTWVKVHS MVDAKGIYYT CGQFKTYYVN FVKEAEKYGS 
TKHWEVCYGS TVIC3PASVS STTQEVSIPE STTYTPAQTS TLVSSSTKED AVQTP PRKRA 
RGVQQSPCWA LCVAHIGPVD SGNHNLITNN HDQHQRRNNS NSSATPIVQF QGESNCLKCF 
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RYRLNDRHRH LFDLISSTWH WASSKAPHKH AIVTVTYDSE EQRQQFLDW KIPPTISHKL 
GFMSLHI.il.Mf AlAKHLDACQ DQLLELYEEN SIDIHKHIMH WKCIRLESVL LHKAKQMGLS 
HIGLQVVWL TVSETKGHNA IEMQMHLESL AKTQYGVEPW TLQDTSYEMW LTPPKRCFAK 
QGNTVEVKFD GCEDNVMEYV WTHIYLQDN DSWVKVTSSV DAKGIYYTCG QFKTYYVNFN 
5 KEAQKYGSTN HWBVCYGSTV ICSPASVSST VREV3IAEPT TYTPAQTTAP TVSACTTEDG 

VSAPPRKRAR GPSTMNTLCV ANIRSVDSTI NNIVTDNYNK HQRRNNCHSA ATPIVQLQGD 
SNCI.KCFRYR LNDKYKHLFE LASSTWHWAS PEAPHKNAIV TLTYSSEEQR QQFLNSVKIP 
PTIRHKVGFM SIrHLLMADDS GTENEGSGCT GWFMVEAIVQ HPTGTQISDD EDEEVEDSGY 
DMVDFIDDSN ITHNSLBAQA LFNRQEADTH YATVQDLGGK YLGSPYVSPI NTIAEAVESE 
10 ISPRLDAIKL TRQPKKVKRR LFQTRELTDS GYGYSEVEAG TGTQVEKHGV PENGGDGQEK 

DTGRDIEGEE HTEAEAPTNS VREHAGTAGI LELLKCKDLR AALLGKFKEC FGLSFIDLIR 
PFKSDKTTCL DWWAGFGIH HSISEAFQKL IEPLSLYAHI QWLTNAWGMV LLVLLRFKVN 
KSRSTVARTL ATLLNIPENQ MLIEPPKIQS GVAALYWFRT GISNASTVIG EAPEWITRQT 
VIEHGLADSQ FKLTEMVQWA YDNDICEESE IAFEYAQRGD fdsnarafln snmqakyvkd 

15 catmcrhykh aemrkmsikq wikhrgskie gtgnwkpivq flrhqniefi pfltkfklwl 

HGTPKKNCIA IVGPPDTDKS YFCMSLISFL GGTVISHVNS SSHFWLQPLV DAKVALIiDDA 
TQPCWIYMDT YMRNLLDGNP MSIDRKHKAL TLI#CPPLLV TSNIDITKED KYKYLHTRVT 
TFTFPNPFPF DRNGNAVYEL SNTNWKCFFE RLSSSLDIQD SEDEEDGSNS QAFRCVPGTV 

VRTL 

20 

The ColE1 cer sequence was obtained from a subclone from plasmid pDAH212 
from. David Hodgeson (Warwick University) and amplified by PCR using, primers 
to place EcoRI restriction sites at the ends of the sequence. The cer sequence 
25 was then inserted into the EcoRI site of p7313-PL to produce plasmid p7313- 
PLc, Tue sequence of the amplified cer was verified against the Genbank entry 
M11411. 

Example 2 - Expression in mammalian 293T cells 

30 

Mammalian 293T cells were grown at log phase at a final concentration of 2 
X10 5 cells per 6 well Corning Costar™ (Corning Science Products, 10 The 
ValleyCentre, Gordon Road, High Wycombe, Bucks, UK)tissue culture plate 
overnight at 37°C in 5%C0 2 . The following transfection mix was prepared and 

35 complexed for 25 minutes: 
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DNA of Interest 2 ng 

Made up with sterile double distilled water 16^1 

OPTI-mem™ (Gibco BRL, Paisley, Scotland) 8|il 

Lipofectamine™ (GibcoBRL) 6ul. 



Each cell monolayer in a well was washed carefully twice with OPTI-mem 
800(11 of OPTI-mem™ was added to each well. 200^1 of OPTI-mem™ was 

10 added to each transfection mix, mixed and added gently to a cell monolayer. 
The plate was incubated for 5 hours at 37°C in 5% C0 2 after which the 
transfection mix and OPTI-mem™ were discarded. The cell monolayers were 
washed gently with cell growth medium twice and finally transfected cells were 
incubated for 24 hours in Dulbecco's Modified Eagle Medium containing 10% 

15 foetal calf serum and 29.2mg/ml of L-glutamine at 37°C in 5% C0 2 . The cells 
were scraped off into microtubes, washed twice with PBS, spun down and the 
cell pellet was resuspended in SDS Page Laemmli dye. The cell pellets were 
boiled and loaded onto a 10%SDS Page gel, electrophoresed in 1X Tris Glycine 
SDS buffer. After electrophoresis, the gel was blotted onto Nitrocellulose 

20 membrane (Amersham) and Western Blotted. The nitrocellulose membrane was 
blocked with 5% Marvel™ (Premier Beverages, Knighton, Adbaston, Stafford, 
UK) in PBS for 30 min at room temperature and washed twice with PBS and 
0.1% Tween 20. A polyclonal antibody raised against the C terminal protein 
sequence of KPVGbEl (protein sequence: CSSSLDIQDSEDEEDGSNSQAFR 

25 Seq. ID No. 23) in rabbits, was diluted in 5% Man/el™ in PBS and added to the 
nitrocellulose membrane. This was incubated at room temperature for 1 hour 
with gentle agitation. A polyclonal antibody against HPV11E1 was also used to 
check cross reactivity. The diluted antibody was removed and the membrane 
washed three times with PBS and 0.1% Tween 20. A secondary conjugate, 

30 Swine anti-rabbit horseradish peroxidase (HRP) (DAKO), was diluted 1 :20000 in 
PBS and 0.1% Tween 20. This was added to the washed membrane and 
incubated with gentle agitation at room temperature for 1 hour. The membrane 
was then washed thoroughly with PBS and 0.1%Tween20. A Chemiluminescent 
HRP kit (Amersham) was used to detect the transferred proteins on the 

35 membrane. 
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Results: 

The results (Fig. 13) show a correct protein size expressed by each of HPV 116, 
117, 118 containing the codon optimised HPV polyproteins. 

5 

HEK293T cells were transfected with ~0.5ug DNA of the respective constructs 
and the cells harvested 24hrs later. These samples were then analysed by first 
polyacrylamide electrophoresis and then Western blotting. Two peptide antibodies 
10 were used to detect for polyprotein expression (~180kd); Anti-6bE1(no.1097) 

and anti-6bE2 (no.1101). 



Example 3 

15 E1 antigen inactivation and experimental confirmation 

Tne HPV E1 protein is a well conserved nuclear protein with non-specific DNA 
binding, ATPase and helicase activities. E1 also binds to host cellular DNA 
polymerase-a primase and, to the HPV E2 protein which then 'recruits' E1 into 
20 the pre-initiation viral DNA replication complex. The primary role of E1 is to 
initiate virus specific DNA replication in infected cells. 

The DNA replication functions of E1 (and E2) are relatively non-specific and 
many studies have now shown that the E1 and E2 proteins from one genotype 

25 can drive the origin specific DNA replication of a plasmid carrying the replication 
origin sequence from a different genotype. Studies have also shown that the 
introduction of highly expressed E1 and E2 into cells already harbouring low 
copy number HPV plasmid can result in a significant amplification of that 
plasmid. This promiscuity carries with it a small potential safety risk which the 

30 project sought to eliminate. Consequently, mutations in E1 (and E2) which 
inactivate their replication potential were sought. 

The E1 mutation G482D occurs in a highly conserved ATP binding consensus 
sequence and E1 protein carrying this mutation has been shown to have multiple 
35 functional deficits. Other mutations, towards the N-terminus of the protein 
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(K83G, R84G) have been shown to abrogate nuclear localisation of E1. Failure 
to locate to the nuclear compartment would also serve to separate E1 from host 
replication proteins and viral DNA, providing an additional level of incapacity and 
safety. These mutations (G428D, K83G, R84G) were selected and incorporated 
5 into E i as pari of the HPV DNA immunotherapeutic E1 vector. 

An in vitro HPV DNA replication assay was used to confirm disablement of the 
DNA replication functions of E1 (as a corollary the mutational inactivation of the 
replication enhancing activity of E2 could also be confirmed in this same assay). 

1 0 Briefly, both El and E2 co-operatively activate the HPV origin of replication and 
the E l and E2 proteins from HPV 6b ware known to activate and drive de novo 
DNA replication from the HPV-11 origin. Plasmids encoding our codon- 
optimised E1 and E2 sequences were co-transfected into 293 cells with a 
plasmid carrying the HPV-1 1 origin of replication (ori plasmid). E1 and E2 

1 5 dependent replication of the input ori plasmid is measured by harvesting DNA 

from cells 48 hours after co-transfection (Hirt lysis). Extracted DNA is restriction 
enzyme digested first with Hind III and then Dpn I which digests unmethylated 
unreplicated DNA. DNA's are then southern blotted and hybridised with ori 
plasmid DNA as probe. Bands with a size equivalent to ori plasmid after Dpnl 

20 digestion are markers for de novo in vitro replicated plasmid DNA. 

Wild type E1 and E2 (HPV 1 19 + HPV 120) show a strong band indicative of 
replicated input plamsid DNA. Each of the three lead constructs are negative, 
(HPV1 16, HPV117 and HPV118) showing results; no replication. 



25 



30 



Conclusion: The lead constructs HPV 116, HPV 117 and HPV 118 have no 

DNA replication activity. 

Example 4 



The E2 protein of papillomaviruses is a site-specific DNA binding nuclear protein 
functioning as the primary replication origin recognition protein and assists in the 
assembly of the pre-initiation DNA replication complex. Full length E2 protein 
can also act as either a repressor or activator of viral. transcription depending 
35 upon the position (relative to other transcription factor sites), and the affinity of 
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the protein for its cognate binding site. E2 is also known to influence the 
transcription of several host cellular promoters. The mutational inactivation of 
E2 has been studied extensively and one point mutation in particular Lys 111 -> 
Ala (K1 1 1 A) has been shown to inactivate both the transcriptional and replication 
functions of E2. This mutation may also have the addition benefit of preventing 
nuclear translocation of the protein. This mutation (K111A) was incorporated 
into each E2 antigen as part of the HPV DNA immunotherapeutic. 

We set out to wnflrm the incapacity of K111A mutated E2 and each polyprotein 
construct in an in vitro CAT transcriptional reporter assay. We used two positive 
controls (sources of active E2 protein). These were a construct expressing 
unmutated (active) HPV-11 E2 protein, and a second vector expressing BPV E2 
protein, a strong transcriptional transactivator. These data are shown in figure 
14. 

Conclusion: These data show that protein expressed from the native 
(unmutated) HPV 6b E2 vector is transcriptionally active, whilst mutated (K1 1 1 A) 
E2 is inactive, as are each of the polyprotein vectors HPV 1 16, 117 and 118. 

Example 5 

Expression of and Comparison with Individual Gene Constructs HPV 116, 
HPV 117 and HPV 118. 

Gene expression studies comparing the leads constructs HPV 116, HPV 117 
and HPV 118 failed to identify any clear differences in in vitro gene expression. 
In addition, expression of the polyprotein was equivalent to expression of the 
individual (unfused) antigen in a single plasmid (HPV 110). Equally important, 
the introduction of the point mutations did not impact on gene expression (HPV 
108 and HPV 110). 

Example 6 

In Vivo nnmu.wyeriicity Studies in Mice 
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In order to compare the immunogenic^ of the three different constructs HPV 
116, HPV 1 17 and HPV 118 In vivo, mice were immunised using PMID. 

Each immunisation comprised two shots of 0.5 DNA fired into the shaved 
abdomen of Balb/c (H-2K d ) or C57 BL6 (H-2K b ) mice. Animals were primed with 
1ng DNA, boosted 21 days later with an equivalent dose and culled 5-7 days 
post boost. Sera and spleens were taken for analysis of the humoral and cellular 

immune response generated following PMID. 

Humoral Assays 

Antibodies raised in PMID immunised mice were evaluated using standard 
ELISA methods and recombinant E1 and E2 protein as capture antigen. 
Antibody responses could not be reliably detected except after extended 
immunisation schedules in E2 immunised mice. We did not confirm detection of 
antibody to the E1 antigen in mice. These weak/undetectable antibody 
responses are in keeping with the published literature. 

Cellular Assays 

ELISPOT assays were used to study cellular immune responses in mice. This 
technique is suitable for assessing the frequency of cells within a culture of 
known density that are capable of secreting cytokines specifically in response to 
antigen presented in the context of syngeneic MHC molecules. 

Briefly, a single cell suspension of splenocytes isolated from immunised animals 
is added to specialised microtitre plates coated with anti-cytokine capture 
antibody and incubated overnight in the presence of antigen presented by 
suitable target cells. Cytokine is captured by antibody bound to the plate in the 
area directly around the cell and this remains bound when cells are lysed and 
washed away. Detection is achieved by use of a biotinylated secondary anti- 
cytokine antibody and a streptavidin alkaline phosphatase conjugate. The action 
of this enzyme on a chromophoric substrate allows visualisation of the frequency 
of cytokine producing cells. 
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Vaccinia ELISPOT Assays and Data 

Due to the absence of defined murine T cell epitopes, antigen was provided in 
the form of recombinant vaccinia viruses engineered to express target antigens. 
5 Such viruses were used to infect appropriate target cells for the presentation of 
antigen to effector cells in ELISPOT assays. 

Responses to HPV 6bE1 were detected following PMID of the three candidate 
constructs to C57BL/6 mice. The results of 2 separate experiments were 
10 analysed statistically. The results of a representative experiment are shown in 

the figure 15 and 16. 

Illustrative Immunogenicity data using lead constructs and PMID in mice: 
15 CTL Assays and Data 

Activated CD8+ T cells are able to lyse cells in response to specific peptide 
presented in the context of syngeneic MHC I molecules. This function can be 
determined by Eu3+ release bioassay, a non-radioactive modification of the 
20 traditional chromium release assay. 

Use of this assay for these purposes required the identification of a CD8+ T cell 
epitope derived from the primary sequence of the HPV 6bE1 protein. This was 
achieved by screening a peptide library consisting of 15-mers overlapping by 11 
25 using cytokine ELISPOT. Responding populations were identified as CD4+ or 
CD8+ T cells by standard flow techniques. 

The basis of this technique involves lysis of Eu3+ labeled target cells pulsed with 
cognate peptide, During the course of a two hour incubation, Eu3+ is released 
30 into the culture supernatant upon lysis of target cells by cytolytic T cells. This is 
detected by time-resolved fluorimetry. Specific lysis is expressed as a 
percentage of the total amount of lysis detected when target cells are lysed by 
chemical means. 



35 
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Assessment of cellular immunology data 

The immunologic evaluation of HPV 116, HPV 117 and HPV 118, comprised 
repeat PMID immunisation studies in mice with Vaccinia ELISPOT and CTL 
5 assay analysis as immunologic outputs. All candidates raised a strong immune 

response to e:;-u.h antigen. 

Collectively, the vaccinia ELISPOT data show that responses to E1 are not 
compromised by mutation or by fusion to the E2 antigen components. When 

10 comparing E1 responses between HPV-108 (single 6b E1 construct). HPV 116, 
HPV 1 1 7 and HPV 1 1 8 the responses are not statistically different. Vaccinia 
ELISPOT data do however reveal a difference in responses to the HPV-11 E2 
antigen component. E2 antigen specific responses are significantly greater in 
mice immunised with HPV 1 1 8 than in mice immunised with HPV 1 16 or HPV 

15 117. On this basis alone HPV 118 appears to be a superior immunogen than 

HPV 116 or HPV 117. 

The analysis of El antigen specific CTL lysis also revealed a trend in potency. 
The percentage specific lysis was higher using T-cells form HPV 1 18 immunised 
20 mice than with either of HPV 1 16 or HPV 117. This observation is reproducible. 

Taken together, and on the basis of both vaccinia ELISOT and CTL lysis data, 

HPV 118 is the stronger immunogen. 

25 Conclusion. On purely immunologial criteria construct HPV 1 18 is the most 
immunogenic of the polyproteins. 

Example 7 



30 



PMID delivery of codon-optimised COPV E1/E2 fusion protein is more 
effective in protecting against canine oral papillomavirus disease thar 
either codon-optimised E1 or codon-optimised E2 alone. 
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Introduction 

The canine oral papillomavirus (COPV) animal model is a good mimic of 
mucosal human papillomavirus disease. The features of disease caused in dogs 
by COPV are very similar to that which occurs in humans (Nicholls et al Virology 
2001 , 283(1) 31-39). Importantly it is a mucosal papillomavirus disease model. 
The COPV virus infects the canine mucosal epithelia and, after a lag period of a 
few weeks warts appear which then regress spontaneously after an additional 
period of some weeks.. The COPV virus encodes homologues of each of the 
human papillomavirus genes (E1, E2, E4, E6, E7, L1 and L2). 

The dog COPV mucosal disease model has previously been used as a key 
model in developing the rationale for human virus-like-particle (VLP) 
papillomavirus vaccines (Ghim et al, Vaccines 1995 25, 375-379, Suzich et al, 
PNAS 1 995, 92 1 1 553-1 1 557). Human papillomavirus VLP vaccines are now in 
development, and early stage clinical trials have recently been completed in 
humans. . . 

We show that plasmid DNA encoding a codon-optimised fusion of E1 and E2 
genes when administered by PMID reduces disease burden more effectively 
than either than either a plasmid encoding codon-optimised E1 orcodon- 

optimiso E2 alone. 

Methods 

Construction of the codon-optimised E2/E1 fusion vector 

A synthetic gene encoding a codon-optimised COPV E2 sequence was 
generated using methods described previously. This was fused to the synthetic 
codon-optimised COPV E1 gene recovered from clone pCOPVEl c/o and 
inserted into vector WRG7077 to generate a new clone which was designated 
pCOPVE2/E1 c/o. This clone expresses a polyprotein comprising a fusion of 
COPV E2 (N terminal) and COPV E1 (C terminal). The polyprotein is of the 
expected size as determined by western blotting. 
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Immunisation of Beagle Dogs with pCOPVEl c/o, PC OPVE2 c/o. and 
PCOPVE2/E1 c/o 

Beagle dogs were immunised by PMID with each of three purified plasmids 
5 pCOPVEl c/o, pCOPVE2 c/o and, pCOPV E2/E1 c/o. Animal were immunised 
at 12 cutaneous sites, 6 non-overlapping sites on each side of the abdominal 
midline. All vaccinations were performed under general anesthesia. There were 
five animals in each group. Six weeks after the first vaccination, a boosting 
vaccination was undertaken in an identical manner, using the same procedure. 

10 

Immunised animals were challenged with infectious COPV virus 2 weeks after 
the final boosting immunisation. The mucosa of the upper lip of each animal 
was lightly scarified. 10pl of purified COPV virus preparation was applied to 
each of ten sites (five on each side of the upper lip) and allowed to absorb for a 
1 5 few minute!;, i ne isolation and purification of infectious COPV virus has been 
described (Virology 1999, 265 (2) 365-374). 

After challenge with COPV virus the sites of mucosal challenge were examined 
weekly. The time (after challenge) of wart (papilloma) appearance, and wart 

20 size (mm) was measured. 

In animals immunised with pCOPVEl c/o papillomas developed at the mucosal 
challenge sites beginning at week 7 after challenge. Papillomas continued to 
grow in size reaching a mean size of >3.5 mm by week 1 1 . In animals 

25 immunised with pCOPV E2 c/o papilloma's first appeared at week 8 but and the 
mean papilloma size reached 1 .5 mm at week 1 1 . In animals immunised with 
pCOPVE2/E1 c/o whilst the first signs of disease are co-incident with that of the 
other groups the overall disease burden is significantly reduced. One animal (of 
five) in the pCOPVE2/E1 c/o group was fully protected from disease 

30 development whilst all other animals in the group developed only very small 
papilloma's which regressed in a short period (1-2 weeks). 
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Plasmid DNA encoding a fusion of COPV E1 and COPV E2 are more effective 
than either of COPV E1 or COPV E2 in preventing disease development in this 
animal rnociel of papillomavirus infection. (Figure 18) 
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